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1  Summary  of  Objectives  Sz  Accomplishements 

1.1  Objectives 

The  long-range  goal  of  this  research  program  is  to  form  a  new  system  identification  paradigm  that 
fulfills  all  the  requirements  of  robust  control  design,  j.e.,  to  produce  from  finite  measured  data  a 
model  which  includes  descriptors  for  both  dynamic  and  disturbance  uncertainty. 


1.2  Accomplishments/New  Findings 

Several  directions  were  pursued  during  the  reporting  period:  (1)  least-squares  related  approaches 
were  further  investigated,  (2)  ‘windsurfer’  adaptation  was  completed,  and  (3)  uncertainty  model 
unfalsification  was  developed.  The  latter,  together  with  some  of  the  ideas  in  ‘windsurfer’  adaptation, 
marks  a  significant  advance  towards  attaining  the  program  objectives.  The  new  idea  is  the  use  of 
unfalsification  in  the  adaptive  control  framework.  The  main  ideas  are  described  next. 

Inspired  by  the  work  of  PooUa  et  a/.[ll],  we  have  shown  that  unfalsification  of  the  standard 
robust  control  design  uncertainty  model  is  the  natural  replacement  for  system  identification  when 
the  intended  use  of  the  model  is  robust  control.  For  any  affine  prediction  error  model,  such  as  an 
ARX  model,  the  unfaJsification  step  requires  solving  a  set  of  convex  programming  problems,  specif¬ 
ically  LMI  problems,  of  which  ordinary  least-squares  is  one  member.  The  result  is  an  uncertainty 
tradeoff  curve  between  model  dynamic  and  disturbance  uncertainty.  Hence,  a  family  of  models 
are  unfalsified  from  the  data  record.  More  specifically: 

•  For  affine  models  the  unfalsification  step  is  an  LMI  problem  with  a  very  specific  structure 
which  allows  very  rapid  computations  using  FFTs.  This  makes  real-time  implementation 
feasible. 

•  The  unfalsification  step  is  downward  compatible  with  existing  methods  of  identification,  i.e., 
one  point  on  the  tradeoff  curve  is  identical  to  classical  least-squares  prediction  error  identifi¬ 
cation. 

•  Since  aU  uncertainty  models  along  the  tradeoff  curve  are  equally  unfalsified,  or  valid,  we  are  led 
to  a  natural  method  of  model  selection  via  closed-loop  implementation  and  design  iteration. 
Specifically,  to  each  uncertainty  model  on  the  tradeoff  curve  a  robust  controller  is  designed 
and  implemented  on  the  actual  system.  The  process  starts  with  a  cautious  controller  based 
on  a  large  value  of  the  dynamic  uncertainty  bound.  The  bound  is  gradually  decreased,  and 
the  controller  made  more  aggressive.  Actual  closed-loop  data  is  used  to  evaluate  (unfalsify  or 
falsify)  the  controller.  The  process  continues  as  long  as  the  actual  performance  is  better  than 
that  predicted  by  the  corresponding  uncertainty  model  and  worse  than  that  desired.  If  the 
desired  performance  is  achieved,  then  the  design  is  complete.  If  performance  is  worse  than 
predicted  but  not  as  good  as  desired,  then  the  uncertainty  model  has  been  falsified.  This  new 
data  can  then  be  used  in  the  next  iteration  to  obtain  a  new  tradeoff  curve,  i.e.,  a  new  family 
of  unfalsified  uncertainty  models. 

The  methodology  is  extensible  to  many  forms  of  nonlinear  systems,  but  this  has  not  been  thoroughly 
explored. 
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1.3  Potential  Applications 


Since  system  identification  enjoys  a  widespread  usage  in  the  engineering  community,  it  is  antici¬ 
pated  that  unfalsification  methods  will  be  a  natural,  desireable,  and  in  some  cases,  an  indispensable 
enhancement  to  existing  software  tools.  In  particular,  the  aerospace  community  is  probably  the 
largest  user  of  system  identification  and  control  design,  and  as  a  result  a  downward  compatible 
replacement  which  is  smoothly  compatible  with  robust  control  design  would  be  much  anticipated. 

A  very  important  specific  application  is  adaptive  flight  control  or  adaptive  reconfigurable  con¬ 
trol.  In  this  case  it  is  extremely  important  to  have  a  design  model  which  captures  uncertainty, 
because  the  reconfigured  controller  is  working  in  possibly  a  very  hostile  environment.  In  some 
cases  the  control  objective  is  to  maximize  performance  of  a  damaged  aircraft.  This  is  a  specific 
case  where  unfalsification  of  an  uncertainty  model  would  have  a  significant  impact  to  Air  Force 
missions. 

There  are  also  potential  applications  in  civilian  technology,  in  particular,  semi-conductor  man¬ 
ufacturing,  paper  mills,  and  other  manufacturing  environments  which  involve  high  performance 
requirements  of  sometimes  very  uncertain  nonlinear  systems. 


2  Personnel  Supported 

Prof.  Stephen  Boyd  of  Stanford  University  has  been  a  consultant  on  the  project. 

Prof.  Anderson  of  the  Australian  National  University  in  Canberra,  Australia  has  collaborated 
with  Dr.  Kosut  on  a  number  of  occasions  in  the  past.  This  year  Prof.  Anderson  has  visited  Dr. 
Kosut  twice  at  ISI.  Prof.  Anderson’s  is  not  supported  under  this  contract. 

Mr.  Wee-Sit  Lee,  a  student  of  Prof.  Anderson’s  received  his  Ph.  D.  this  year.  Mr.  Lee  was 
not  directly  supported  under  this  contract,  but  under  the  guidance  of  Prof.  Anderson  and  Dr. 
Kosut,  Mr.  Lee  worked  on  a  method  of  adaptive  control  and  learning,  referred  to  as  “windsurfer 
adaptation,”  which  has  formed  the  basis  for  the  unfalsification  process  described  above. 


3  Publications 

Under  this  contract,  Dr.  Kosut  was  an  author  or  co-author  of  the  publications  in  the  list  to  follow. 

Copies  of  all  of  these  publications  are  included  in  the  Appendix. 

1.  R.L.  Kosut  and  B.D.O.  Anderson,  “Statistical  analysis  of  least-squares  identification  for  ro¬ 
bust  control  design:  output  error  case  with  affine  paramtrization,”  Proc.  1993  ACC,  San 
Francisco,  CA,  June  1993. 

2.  W.S.  Lee,  B.D.O.  Anderson,  R.L.  Kosut,  and  LM.Y.  Mareels,  “A  new  approach  to  adaptive 
robust  control,”  Int.J.  Adaptive  Control  and  Signal  Processing^  vol.  7,  no.  3,  pp.  183-212, 
May-June  1993. 

3.  H.  Aling  and  R.L.  Kosut,  “Unbiased  least-squares  estimates  with  structure  incompatibilities,” 
Proc.  1993  ECC.)  Groningen,  The  Netherlands,  29  June-2  July,  1993. 
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4.  W.S.  Lee,  B.D.O.  Anderson,  I.M.Y.  Mareels,  and  R.L.  Kosut,  “On  some  key  issues  in  the 
windsurfer  approach  to  adaptive  control,”  Automatica,  Vol.  31,  No.  11,  pp. 1619- 1636,  1995. 
Appeared  first  in  Proc.  SYSID  94,  10th  IFAC  Symposium  on  System  Identification,  Copen¬ 
hagen,  Denmark,  4-6  July,  1994. 

5.  M.  Massoumnia  and  R.L.  Kosut,  “A  family  of  norms  for  system  identification  problems,” 
IEEE  Trans,  on  Automatic  Control,  vol.  39,  no.  5,  May  1994,  pp. 1027-1031.  Appeared  first 
in  Proc.  1993  ACC,  San  Francisco,  CA,  June  1993. 

6.  R.L.  Kosut  and  B.D.O.  Anderson,  “Least-squares  parameter  set  estimation  for  robust  control 
design,”  Proc.  1994  ACC,  Baltimore,  MD,  29  June-1  July,  1994. 

7.  R.L.  Kosut,  “Uncertainty  model  unfalsification:  a  new  system  identification  paradigm  com¬ 
patible  with  robust  control  design,”  Proc.  1995  IEEE  CDC,  New  Orleans,  LA,  Dec.  1995. 

8.  R.L.  Kosut,  “Iterative  adaptive  control  via  uncertainty  model  unfalsification,”  invited  session, 
1996  IFAC  World  Congress,  San  Francisco,  CA,  June  1996. 


4  Interact  ions /Transit  ions 

4.1  Interactions 

1.  Dr.  Kosut  gave  a  plenary  lecture  at  the  14th  Benelux  Meeting  on  Systems  and  Control  in 
Houthalen,  Belgium,  March  29-31,  1995. 

2.  Dr.  Kosut  was  invited  to  participate  in  an  NSF/YaJe  sponsered  workshop  on  “Control  using 
Logic  Based  Switching,”  which  is  another  potential  applications  area  for  unfalsified  uncer¬ 
tainty  modeling. 

3.  Dr.  Kosut  gave  a  tutorial  lecture  on  adaptive  control  at  the  forthcoming  ISA  95  industrial 
computing  society  conference  in  New  Orleans,  Oct.  1-6. 

4.  Dr.  Kosut,  Prof.  Gevers  (Louvain  University,  Belgium),  Prof.  Bitmead  (Autralian  Na¬ 
tional  University),  and  Prof.  Smith  (UC/Santa  Barbara)  will  give  a  tutorial  workshop  at  the 
forthcoming  1996  IFAC  World  Congress  in  San  Francisco  on  the  interplay  between  system 
identification  and  robust  control  design. 

4.2  Transitions 

Since  the  conceptual  framework  leading  to  the  unfalsified  uncertainty  modeling  and  tradeoff  curve 
has  only  recently  been  developed,  no  transitions  have  taken  place.  It  is  anticipated  that  this  wiU 
occur,  but  first  it  is  crucial  to  develop  reliable  and  efficient  software  which  can  handle  typical  data 
sets,  which  can  be  quite  large.  Moreover,  for  practical  use,  these  tools  must  be  smoothly  linked 
with  current  robust  control  design  tools. 


5  Inventions,  or  Patent  Disclosures 

None. 
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Expanded  Discussion 


6.1  Assessment  of  State-of-the-Art 

Robust  control  design  and  system  identification  have  each  reached  a  level  of  acceptance  by  the 
engineering  community  because  of  their  sound  theoretical  foundations  and  the  wide  availability  of 
software  tools  based  on  reliable  computational  algorithms.  However,  these  two  methodologies  do 
not  have  a  common  basis,  and  are  at  the  least,  philosophically  incompatible. 

As  described  by  Ljung  [10],  traditional  transfer  function  identification  postulates  an  uncertainty 
model  consisting  of  unknown  transfer  function  coefficients  and  a  random  exogenous  disturbance  with 
an  unknown  variance.  The  unknown  parameters  are  to  be  estimated  from  a  finite  input/ output  data 
record.  In  contrast,  the  standard  uncertainty  model  for  robust  control  design  is  more  realistic  in 
that  it  includes  additional  parameters  for  describing  dynamic  uncertainty  bounds,  thereby  capturing 
model  limitations. 

The  focus  of  this  research  effort  is  to  develop  a  quantitative  methodology  for  simultaneously  es¬ 
timating  the  transfer  function  parameters,  the  noise  variance,  and  the  dynamic  uncertainty  bounds 
from  finite-data.  The  need  for  this  methodology  arises  in  many  cases  where  adaptive  control  could 
achieve  a  high  performance  capability  otherwise  unobtainable. 


6.2  Unfalsification 

The  thesis  of  this  research  program  is  that  traditional  system  identification  must  be  modified  in 
order  to  be  compatible  with  robust  control  design.  The  essence  of  the  modification  is  to  replace 
system  identification  of  the  traditional  uncertainty  model  with  validation  of  the  uncertainty  model 
used  for  robust  control  design.  In  this  context  validation  is  defined  as  follows: 

•  Given  some  data,  a  model  is  said  to  be  validated  if  and  only  if  it  could  have  produced  the 
data. 

As  discussed  by  many  philosophers  pondering  the  basis  of  scientific  knowledge,  validation  of  a 
theory  (or  model)  is  perhaps  a  misnomer,  as  one  can  never  prove  that  a  model  wiU  be  able  to 
accurately  predict  the  future.  However,  new  data  can  falsify  a  model,  i.e.,  the  model  may  prove  to 
be  incapable  of  producing  the  new  data.  Hence,  a  more  illuminating  term  is  unfalsification.  Thus, 
if  a  data  record  unfalsifies  a  model,  that  model  is  good  enough  to  explain  the  current  data,  but 
might  be  falsified  by  a  new  data  record. 

Clearly  model  unfalsification  is  a  feasibility  problem  -  find  a  member  of  the  model  set  that 
is  consistent  with  the  data.  But  many  postulated  models  could  have  produced  the  data.  So 
what  makes  one  validated  model  more  special  than  any  other?  Identification,  it  seems,  poses  an 
optimization  problem:  pick  the  model  that  has  the  smallest  noise  variance.  Why?  It’s  not  the  only 
unfalsified  model. 

In  this  past  year  we  have  taken  a  step  towards  answering  this  question  by  establishing  a  new 
system  identification  paradigm  which  is  truly  compatible  with  robust  control  design.  Inspired  by 
the  work  of  PooUa  et  a/.[ll],  we  have  shown  that  unfalsification  of  the  standard  robust  control 
design  uncertainty  model  is  the  natural  replacement  for  system  identification  when  the  intended 
use  of  the  model  is  robust  control  [5].  For  the  ARX  model  (or  any  affine  prediction  error  model), 
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the  unfalsification  step  requires  solving  a  set  of  convex  programming  problems  specifically  LMI 
problems,  of  which  ordinary  least-squares  is  one  member.  The  result  is  an  uncertainty  tradeoff 
curve  between  model  uncertainty  and  disturbance  uncertainty.  Hence,  a  family  of  models  are 
unfalsified  from  the  data  record. 

Since  all  uncertainty  models  along  the  tradeoff  curve  are  equally  unfalsified,  or  valid,  we  are 
led  to  a  natural  method  of  model  selection  via  closed-loop  implementation  and  design  iteration. 
Specifically,  to  each  uncertainty  model  on  the  tradeoff  curve  a  robust  controller  is  designed  and 
implemented  on  the  actual  system.  The  process  starts  with  a  cautious  controller  based  on  a  large 
value  of  the  dynamic  uncertainty  bound.  The  bound  is  gradually  decreased,  and  the  controller 
made  more  aggressive.  Actual  closed-loop  data  is  used  to  evaluate  (unfalsify  or  falsify)  the  con¬ 
troller.  The  process  continues  as  long  as  the  actual  performance  is  better  than  that  predicted  by 
the  corresponding  uncertainty  model  and  worse  than  that  desired.  If  the  desired  performance  is 
achieved,  then  the  design  is  complete.  K  performance  is  worse  than  predicted  but  not  cis  good  as 
desired,  then  the  uncertainty  model  has  been  falsified.  This  new  data  can  then  be  used  in  the  next 
iteration  to  obtain  a  new  tradeoff  curve,  i.e.,  a  new  family  of  unfalsified  uncertainty  models. 

Thus,  in  summary,  two  changes  must  be  made  in  the  system  identification  process:  (1)  postulate 
a  candidate  model  set  which  includes  descriptors  for  both  disturbance  and  dynamic  uncertainty,  and 
(2)  use  the  data  to  produce  unfalsified  models  in  the  candidate  model  set.  There  are  clearly  many 
theoretical  issues  to  be  resolved,  e.g.,  convergence  of  the  iterations,  limitations  of  performance,  etc. 
In  addition,  for  the  iteration  to  be  effective,  reliable  and  rapid  computational  methods  need  to  be 
developed.  Preliminary  results  in  [5]  show  that  the  computations  involved  are  very  similar  to  those 
of  system  identification. 


6.3  Related  and  Supporting  Research 

Replacing  identification  with  unfalsification  has  been  investigated  and  heralded  by  a  number  of 
researchers.  The  uncertainty  tradeoff  curve  discussed  here  was  first  presented  in  [6],  but  at  that 
time  there  was  no  unfalsification  framework  upon  which  to  provide  a  raison  d’etre  for  the  result. 
One  of  the  earliest  attempts  to  formulate  an  unfalsification  problem  for  robust  control  oriented 
uncertainty  models  can  be  found  in  the  dissertation  of  Smith  and  related  papers  [16,  15].  The 
approach  is  based  on  frequency  domain  data  with  a  structured  block  diagonal  uncertainty  model. 
In  [11],  the  model  validation  problem  is  posed  using  time-domain  data  and  both  necessary  and 
sufficient  conditions  are  obtained.  In  the  work  of  Doyle  et  a/.[l],  a  large  class  of  unfalsification  and 
identification  problems  are  cast  in  an  implicit  linear  fractional  framework  which  clearly  shows  which 
problems  are  solvable  and  which  are  hard.  Iterating  and  unfaslifying  in  closed-loop  is  outlined  in 
the  short  essay  by  Dahleh  and  Doyle  [2],  and  expanded  upon  in  the  paper  by  Livestone  et  a/. [8] 
and  dissertation  [9].  Direct  controller  unfalsification  is  proposed  by  Safonov  et  a/.[17,  18,  12]  which 
uses  computational  techniques  similar  to  those  formulated  by  PooUa  et  a/. [11]. 

Other  approaches  have  been  put  forward  which  also  involve  iterating  on  closed-loop  data  but 
use  traditional  identification  rather  than  unfalsification.  These  are  reviewed  in  the  survey  paper  by 
Gevers  [3].  The  fundamental  idea  is  to  make  the  identification  and  control  design  criteria  identical 
by  iteratively  adjusting  data  and  weighting  filters,  e.y.,  [13,  7,  20,  19].  In  a  related  approach, 
referred  to  as  “windsurfer”  adaptation  [7],  the  closed-loop  bandwidth  is  increased  every  iteration, 
if  possible,  and  in  effect,  the  maximum  closed-loop  bandwidth  is  achieved.  Many  of  these  ideas  can 
be  hopefully  made  more  precise  by  replacing  the  identification  step  with  unfalsification. 
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Statistical  Analysis  of  Least-Squares  Identification  for  Robust  Control 
Design:  Output  Error  Case  With  Affine  Parametrization 


Robert  L.  Kosut* 
Integrated  Systems,  Inc. 
3260  Jay  St. 

Santa  Clara,  CA  95054,  U.S.A. 


Abstract  Precise,  finite-data  statistical  propereties  Me  de- 
termined  using  a  least-squares  estimator  based  on  an  output 
error  model  with  an  affine  parameter  representation  where  the 
true  system  is  of  output  error  form,  but  is  not  in  the  model  set. 
The  purpose  of  the  analysis  is  to  show  the  effect  of  unmod¬ 
eled  dynamics  on  the  resulting  closed-loop  system  designed 
on  the  basis  of  the  estimated  transfer  function.  This  simple 
problem  set-up  is  prototypical  of  the  interplay  between  system 
identification  and  robust  control  design. 

Introduction 

The  problem  addressed  is  the  following:  given  a  finite  col¬ 
lection  of  sensed  sampled  input/output  data  from  an  unknown 
system,  what  level  of  confidence  can  be  assigned  to  a  feedback 
controller  design  or  modification. 

To  make  the  problem  both  representative  and  analytically 
tractable,  the  following  a  priori  qualitative  data  is  assumed: 

(al)  The  system  which  is  generating  the  data  is  a  discrete 
linear-time-invariant  system  in  output  error  form,  i.e., 

yt  =  (Gu)t  +  et  (1) 

where  t  is  the  sampling  time,  u  and  y  are  the  sensed 
input  and  output  sequences,  respectively,  and  e  is  an  un¬ 
predictable  output  disturbance.  The  operator  G  is  linear¬ 
time-invariant  with  unknown  transfer  function  G{z)  and 
corresponding  impulse  response  sequence  g.  Thus, 

oo 

{Gu)t  =^^gkUt-k  (2) 

k=l 

(a2)  G'(z)  is  stable,  i.e.,  all  the  poles  of  G{z)  are  strictly  inside 
the  unit  circle.  Hence,  there  exist  positive  constants  M  > 
1  and  p  <  1  such  that 

Isffcl  <  A/p*-',  Vjt  >  1  (3) 
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(a3)  The  unpredictable  sequence  e  is  zero-mean  gaussian  i.i.d. 
with  unknown  variance  A<.. 

(a4)  The  input  sequence  u  is  deterministic,  hence,  indepen¬ 
dent  of  e. 

It  is  important  to  emphasize  that  none  of  the  parameters  that 
appear  in  the  above  assumptions  are  assumed  to  be  known; 
they  are  only  known  to  exist.  Hence,  there  is  no  quantitative 
a  priori  knowledge  about  Af,  p,  or  Ac. 

The  above  qualitative  assumptions  do,  however,  impose 
varying  degrees  of  restrictiveness.  Assumption  (al)  imposes 
an  LTI  structure,  which  by  itself  is  not  necessarily  restrictive, 
however,  the  output  error  form  is  very  specific.  This  latter 
restriction,  together  with  the  gaussian  assumption  (a3)  makes 
the  statistical  analysis  easier  without  resorting  to  a  central 
limit  theorem  or  a  law  of  large  numbers.  Assumption  (a4)  im¬ 
plies  that  the  system  is  operating  in  open-loop,  for  otherwise 
u  would  have  a  component  which  is  correlated  with  e. 

For  control  design  it  is  desireable  to  obtain  an  estimate 
of  G{z).  It  is  standard  practice  to  form  a  parametric  model 
G(zy6)  and  estimate  the  free  parameter  9.  Although  many 
parametric  forms  are  possible,  e.p.,  [4],  for  ease  of  analysis  we 
choose  the  following  affine  FIR  paramtrization: 

n 

G{z,d)  =  Y,e,z-'‘  (4) 

Jt=l 

Thus,  the  problem  is  to  estimate  the  first  n  impulse  response 
coefficients  Although  we  specialize  to  the  FIR 

modeling  case,  all  the  results  apply  mutadis  mutandis  to  any 
other  affine  model  of  G{z)y  e.p.,  Laguerre  or  Kautz  models  as 
described  in  [5].  The  essence  of  the  problem  addressed  here 
is,  in  our  opinion,  the  motivation  for  the  work  described  in 
the  recent  special  issue  [6]  on  system  identification  for  robust 
control  design.  In  comparison  with  [2],  the  smoothness  pa¬ 
rameters  My  p  are  not  estimated  by  modeling  the  tail  of  the 
impulse  response  {jfn+i ,  ^n+2,  •  ♦ as  a  random  variable.  Our 
attempt  here  is  to  precisely  determine  the  effect  of  the  un¬ 
modeled  dynamics,  i.e.,  the  tail  of  the  impulse  response,  on  a 
least-squares  parameter  estimator,  without  any  further  prior 
assumptions. 


Least-Squares  Estimation 


In  this  section  we  use  least-squares  on  the  measured  data  to 
estimate  the  first  n  impulse  response  coefficients  52,  • . .}  in 
(2).  Towards  this  end,  the  unknown  impulse  response  param¬ 
eters  , . . . ,  }  are  partitioned  into  the  (finite)  parameter 

vector  to  be  estimated, 


which  consists  of  the  first  n  impulse  response  coefficients,  and 
the  (infinite)  parameter  vector 

r  i?n+l 

13  =  ^  IROO  .g. 


which  is  the  remainder  of  the  impulse  response.  These  param¬ 
eters  -  the  “tail”  of  the  impulse  response,  {fi^n+i,5fn+2, . .  •} 
-  can  significantly  bias  the  estimate  of  the  “head,”  namely, 
{5^1,  • .  • ,  i?n).  Statisticians  refer  to  /?  as  a  “nuisance”  parame¬ 
ter.  Note  that  because  G  is  stable,  ||/?||  is  not  only  finite,  but 
decreases  exponentially  as  n  increases.  That  is,  using  (3), 

=  E  ^  (T) 

*=n+l  ^ 

Using  the  definition  of  a  and  /?  together  with  (l)  gives, 

Yr=:Xa-\-X/3  +  E  (8) 


€IR^, 


X  = 


When  ^  =  0,  it  is  well  known  that  a  and  Ac  are  the  maximnn 
likeihood  estimates  of  a  and  Ac,  respectively,  e.y.,  [l].  In  on 
case,  ^  ^  0,  and  its  effect  on  the  estimates  is  the  subject 
the  next  section. 


Statistical  Analysis 

In  this  section  we  ajialyze  the  effect  of  the  nuisance  pb- 
rameter  ^  on  the  estimates  a  and  Ac  of  a  and  Ac,  respec¬ 
tively.  We  use  the  standard  notation  A/'(/i,E)  to  denote  l 
gaussian  distribution  with  mean  fx  and  variance  E.  LLke- 
'vise,  x^(”^)  denotes  a  chi-squared  distribution  with  m  de¬ 
grees  of  freedom.  Recall  that  if  ^  6  IR"'  is  drawn  fron 
^/{0,R)  with  R  non-singular,  then  qR'^^q  G  V^(^)- 
also  use  (”^,  ^)  to  denote  a  non-central  chi-squared  distr- 
bution  with  m  degrees  of  freedom  and  non-centraJity  parame¬ 
ter  r.  To  fix  the  definition  of  the  non-centrality  parameter,  if 
q  G  IR”"  is  drawn  from  AT (^,  R),  then  q'R~^q  G  (m,  r)  witi 
r  =  From  [3],  we  also  use:  as  either^  tti  or  r  — -x 

(m,  r)  Ar(m  4-  r,  2  (m  -b  2r)).  Hence,  (m,  0)  =  x^(m 
and  as  m  00,  x^(rn)  —  A/'(m,  2m). 

It  is  convenient  to  define  the  “covariance”  matrices,^ 

En  =  (14 

Ei2  =  (li 

S22  =  ix'x  (]•; 

Observe  that  only  En  can  be  formed  from  the  data  and  bx 
assumption  is  invertible. 

The  following  theorem  describes  the  distributions  of  the  kex 
random  variables. 

Theorem  1  Define  the  parameter  error, 


and  the  output  error, 


Of  =  Of  —  Of 


E^Y^XS 


L  UN^n^l  ttAr-n-2  J 

Assuming  that  -Y  G  IR*^^”^  is  non-singular,  *.e.,  u  is  persis¬ 
tently  exciting  of  order  n,  the  least-squares  estimate  of  a  is 
given  by  the  well  known  formula: 


=  arg  \\Y  ^  XBf  =  (X'X)-'^X'Y  (12) 


where  |  ^  =  1  :  n  }  can  be  thought  of  as  estimates  of 

{gk  I  A:  =  1  :  n).  We  also  take  the  estimate  of  Ac,  the  out¬ 
put  error  variance,  as  the  sample-variance, 

A.  =  i||y-A'Sf  (13) 


Under  assumptions  (al)-(a4), 

(i)  The  parameter  error  a  and  the  residual  E  are  indeper^ 
dent  and  normally  distributed  as  follows: 

a  €  Ar(Er,‘E:2^,  ^Ef/)  (K- 

E  €  A.-r)  (2C 

where  F  6  given  by, 

T  =  Iu-XiX'Xy^X'  (21. 

has  rank  A  -  n  and  is  idempotent,  i.e.,  F  =  F^. 

It  can  be  shown  that  this  result  is  also  true  if  both  m  or  r  — oc 
^Although  the  matrices  '^12^^27  infinite  dimensional,  the^ 
always  appear  multiplying  /?.  Hence,  these  terms  are  bounded  be¬ 
cause  the  elements  in  /?  decay  exponentially. 


(it)  and  -^a'Eiia  have  the  following  non-central  chi- 


€ 

squared  distributions: 


€  X^(n-  n,  £«)  (22) 

€  x"  (n,  (23) 


^aEna  €  X  [n,  j-y) 


7  =  ;9'E;2Eri’E,2|9 


«  =  /3'E22/?  -  7  =  iS'  (E22  -  E;2Eri‘Ei2)  fi  (25) 

(Hi)  As  N  —>■  00, 

Ae  ^  J^(^X,  +  S,  ^(A.  +  25))  (26) 


(nXe 

2A« 

/nAe  „  \X 

N 

AAc 

A« 

€  x"(iV-n,  £A„||;3f) 

(31) 

G  A"(n) 

(32) 

!ton,  as  N  oo, 

— ►  Af  ^Ac  +  Ati 

ll^ir-  ^(A.  +  2A„||/?in) 

(33) 

The  asymptotic  part  of  the  above  theorem  leads  to  the  follow¬ 
ing: 

Approximation  2  For  sufficiently  large  N,ifu  is  white,  i.e., 
(30)  holds,  then  with  high  probability: 


Q'^SiiQ'  — ►  Af 


The  results  in  part(i)  follow  directly  from  the  underlying  as¬ 
sumptions  and  definitions  of  the  variables,  and  except  for  the 
non-zero  bias  terms,  are  standard,  e,g.,  [l].  Part  (ii)  is  non¬ 
standard,  in  that  the  error  statistics  involve  non-central  chi- 
square  distributions.  These  results  are  obtained  by  direct  ap¬ 
peal  to  the  relation  between  a  normzJly  distributed  random 
variable  and  the  non-centrcd  chi-squred  statistic  as  stated  in 
the  introduction  to  this  section.  The  asymptotic  results  in 
part(iii)  follow  from  the  asymptotic  normal  approximation  to 
a  non-central  chisquare  distribution  as  stated  in  the  introduc¬ 
tion  to  this  section. 

In  part  (iii)  of  the  theorem,  the  asymptotic  variances  decay 
as  l/7\r.  Hence,  for  sufficiently  large  N,  the  random  variable 
approaches  the  mean  with  high  probability.  This  leads  directly 
to  the  following: 

Approximation  1  For  sufficiently  large  N ,  the  following  ap¬ 
proximations  hold  with  high  probability, 

Ac  Ae  +  5  (28) 

o'EiiO?  « 

Observe  that  for  large  N,  the  variance  estimate  Ac  tends  to 
over-^timate  the  true  variance  Ac.  In  addition,  the  errors  a 
and  Ac  —  Ac  are  driven  by  the  “nuisence”  parameter  p,  i.e., 
the  tail  of  the  impulse  response. 

A  special  case  of  interest  is  when  the  input  u  is  white,  i.e., 

Sll  =  Au  •  /ni  S12  =  0,  E22  =  Au  '  loo 

Theorem  2  If  u  is  white,  i.e,,  (30)  holds,  then: 


Ae  a  A.  +  A„W  (34) 

11“"'  ^  'ft 

Large  N  and  High  Probability 

When  the  input  is  white,  “large  N”  can  be  taken  as, 

where  t/  is  the  ratio  of  the  energy  in  the  tail  to  the  output 
error  energy.  Typical  values  of  N,  e.g.,  500-1000,  will  always 
be  well  in  excess  of  variations  caused  by  t/.  Moreover,  from 
central  and  non-central  chi-square  tables  (e.g.,  [3]),  values  of 
A  >  100  and  n  >  20  make  the  normal  approximations  very 
accurate.  In  consequence,  “high  probability”  is  in  excess  of 
99.95%  for  typical  data  lengths  and  model  orders.  Similar 
numbers  hold  for  the  general  case  with  a  non-white  input. 

Frequency  Response  Estimation 

The  results  of  the  previous  section  can  be  used  to  analyze 
the  errors  in  frequency  response  estimation.  Towards  this  end, 
express  G(z),  the  true  transfer  function  as, 

G(z)  =  DizYa  +  D{zy/3  (37) 


D{z)^ 


D{z)  = 


Let  G(z)  denote  the  transfer  function  estimate  of  G(z)  defined 
as 

G(z)  =  D(zys  (39) 

where  a  is  the  least-squares  parameter  estimate  from  (12)  of 
the  the  first  n  impulse  response  coefficients  of  G(z).  Let  A(2) 
denote  the  transfer  function  error  defined  as, 

A(z)  =  G(z)-d(z)  (40) 

=  -D{zyS  +  D{zyfi  (41) 


D{zya  =  ^^(gk  -  gk)z  D(zy/}  =  ^  gkZ  *  (42) 

fc=l  >:=n+l 

with  Q  the  parameter  error  from  (17). 

From  Theorem  1  the  following  result  is  obtained. 
Theorem  3  The  following  results  hold  at  each  frequency  w : 


IS 


(i)  Normal  distribution 

^D(en''^-^D(en)  (43) 

where 

fizY  ^  D{zy -- Dizy^ZiE^2  (44) 

(ii)  Non-central  chi-squared  distribution 


(Hi)  Asymptotic  Normality 
j4a  N  — ►  oo, 

|A(e>“)P 

2(1  + 2e(a))))  (52) 

\N  X,) 

Part  (iii)  together  with  Approximation  2  leads  to: 


|A(e>“)p 

^i)(eJ“)*E“^i)(eJ")  (1>«(‘")) 

with  non- centrality  parameter^ 

e(a,)  =  - 


Approximation  4  If  u  is  white,  i.e.,  (SO)  holds,  then  for 
(45)  sufficiently  large  N,  the  following  approximation  holds  with 
high  probability  at  each  frequency  w : 


(46) 


|A(enP«^^  +  |5(e>W 


(53) 


(Hi)  Asymptotic  Normality 
As  N  00, 

|A(6^-)P 

^Z?(e>")*Eri‘Z?(e>“) 


A^(l+e(u>),  2(l+2e(w))) 
(47) 


Part  (iii)  leads  to  the  following  result. 


Robust  Control  Analysis 

In  this  section,  we  use  the  asymptotic  frequency  domain 
bounds  to  evaluate  controUer  robustness.  The  goal  of  control 
b  to  reduce  the  output  variance.  Consider  the  LTI  feedback 
controller 

«  =  -Ky  (54) 

where  K  stabilizes  the  “estimated”  FIR  system 


Approximation  3  For  sufficiently  large  N ,  the  following  ap- 
proximation  holds  with  high  probability  at  each  frequency  01: 

I  A(e^“)|^  «  +  |F(e^“')'^p  (48) 

Observe  that  if  u  is  white  (30)  then 

f3(e-'“')*E-/D(e-’“')  =  £)(e-’“')*  (p/n)  f)(e^“') 

=  ^Z?(e^“')*f?(e^“')  =  — 

Au 

This  leads  to  the  following: 

Theorem  4  If  u  is  white,  i.e.,  (30)  holds,  then  at  each  fre- 
quency  to: 


y  — Gu  +  e,  G(^)  =  ^5*^  *  (55) 

k=l 

Applying  the  control  (54)  to  the  actual  system  (1)  yields  the 
closed-loop  system 


y 


where 

f 


T  Q 

- u  = - 

1  +  QA  1  +  Q  A 

1+GK  1+GK 


(56) 


(57) 


with  A  the  estimation  error  as  defined  in  (40).  Since  the 
nominal  system  b  stable,  it  foUows  that  A,  f ,  and  Q  are  stable 
transfer  functions.  Hence,  the  closed-loop  system  is  stable  if 
and  only  if, 


(i)  Normal  distribution 


A(c^“’)  €  A^  (5(e^“)V, 

(ii)  Non-central  chi-squared  distribution 


|A(e>“')p 

(JL 

X,} 


(l,e(a;)) 


with  non-centrality  parameter 


f(w) 


(#t) 


|l  +  Q(e-'“')A(eni>0,  VH<x 


(58) 


(49)  liolds,  then  the  spectrum  of  y,  under  closed-loop  -not 

during  identification-  is  given  by: 


#,(a>) 


(50) 


(51) 


f(e>“’) 


1  +  <3(e-'“)A(e>“') 


Ae 


(59) 


Suppose  that  u,  during  identification,  is  white,  i.e.,  (30) 
holds.  To  establbh  stability,  observe  that  a  sufficient  condition 
for  stability  is  that, 


It 


|Q(e^‘^)|-|A(eni<l,  VM<x 


(60) 


Using  the  expression  for  |A(e-'*^)|  in  Approximation  4  and  sub¬ 
stituting  for  Ae  from  (34).  it  follows  that  for  large  iV,  the 
closed-loop  system  is  stable,  with  high  probability,  if, 


\Q{en\" 


3n  I  Ac 

'n  I  aT 


^  -  ll^ir  )  + 


<  1,  V(tj|  <  TT 
(61) 

Hence,  using  the  large  N  approximations,  with  high  probabil¬ 
ity,  the  output  spectrum  is  bounded  as  follows: 


|f(e^“)P  (a.  -  A.  mf  ) 

$y(w)  <  - - —A - i - 

(l  -  |Q(e7«)|  (1^  -  ||;3||^)  +  |5(e^-*')'^p] j 

The  only  unknown  quantity  is  /?.  From  (34),  we  also  know 
with  high  probability  that, 

Ae  «  Ac  —  Xu  \\P\\^ 

Since  Ac  must  be  positive,  it  follows  that 

Pll"  <  Ae/Au  (63) 

provides  a  worst-c^e  upper  bound.  Observe  that  this  bound 
is  known  because  Ac  is  the  computed  variance  estimate  and 
Xu  is  selected  by  the  user  as  the  input  variance.  As  a  practical 
matter,  it  is  unlikely  that  ^  will  achieve  this  bound.  If  it  did, 
then  the  noise  variance  Ac  0,  which  for  large  N,  will  almost 
never  occur. 

Using  (3),  we  get 


oo 

\D{cn'p\  =  I  Y.  ^ 

fc=n+2  ^ 

Hence,  for  large  iV,  the  closed-loop  system  is  stable  with  high 
probability  if, 


3nAc 

AT  A.  (1  -  pY 


<  1,  V|a;|  <  X 


(64) 


The  constants  M  and  p  are  unknown,  so  in  order  to  evalu¬ 
ate  the  above  robustness  condition,  either  we  require  a  priori 
knowledge  or  infer  the  values  from  the  first  n  impulse  response 
coefficients  =  [Ji  •  "yn].  That  is,  define  the  estimates  Af,p 
via 

\gk\  <  V/:  €  [l,n]  (65) 

and  replace  M,  p  with  Af ,  p.  This  leads  to  the  robustness  test: 


'SnAc 

.  iv  A„  (1  -  ^2 


<  1,  Vlu;|  <  X 


(66) 


Now,  suppose  that  the  closed-loop  system  is  stable  and  the 
above  inequality  holds.  Then  the  spectrum  of  y  is  bounded, 
with  high  probability,  by:  of  y  and  u  are  given,  respectively, 
by: 


^y(^)  < 


\T{e 


Ac 


(- 


n 

N  A«  (j 


-P)^  J 


The  above  bound  gives  an  indication  of  the  trade  between 
bias  and  variance  as  the  model  order  varies  -  all  results  being 
valid  for  data  length  N  >  500  with  probability  in  excess  of 
99.95 


Concluding  Remarks 


Using  an  output  error  linear  plant,  we  have  shown  that 
with  gaussian  noise  and  affine  models,  there  is  a  very  rich 
structure  in  the  analysis  of  standard  least-squares  estimation 
of  the  first  n  impulse  response  coefficients.  The  remaining 
coefficients  bias  the  estimate  in  a  precisely  defined  way  in¬ 
volving  non-central  chi-squared  statistics.  These  appear  to  be 
extremely  useful  in  predicting  model  error  for  robust  control 
design  from  finite  data  records.  Much  still  remains  to  be  done 
even  for  this  restricted  and  analytically  tractable  case,  partic¬ 
ularly  in  finding  a  means  to  bound  the  effect  of  the  bias  (the 
tail  of  the  impuke  response)  without  having  to  perform  addi¬ 
tional  identification  with  ever  larger  parameter  orders.  This 
ultimately  may  involve  additional  a  priori  quantitative  knowl- 
wdge.  We  feel  that  this  paper  indicates  a  first  step  towards 
the  more  difficult  problem  of  model  structures  which  account 
for  non-white  noise,  e.y.,  ARX  or  ARMAX  models. 
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SUMMARY 

A  new  approach  is  given  for  the  design  of  adaptive  robust  control  in  the  frequency  domain.  Starting  with 
an  initial  model  of  a  stable  plant  and  a  robust  stabilizing  controller,  the  new  (windsurfer)  approach  allows 
the  bandwidth  of  the  closed-loop  system  to  be  increased  progressively  through  an  iterative  control¬ 
relevant  system  identification  and  control  design  procedure.  The  method  deals  with  both  undermodelling 
and  measurement  noise  issues.  Encouraging  results  are  obtained  in  the  simulations  that  illustrate  the  new 
idea. 

KEY  WORDS  Adaptive  control  Robust  control  Internal  model  control  System  identification 


1.  INTRODUCTION 

It  has  long  been  understood  that  a  key  problem  in  control  system  design  is  to  handle  the 
uncertainties  associated  with  the  plant.  *  Two  main  techniques  for  the  analysis  and  design  of 
systems  with  significant  uncertainties  are  adaptive  control^  and  robust  control. 

In  the  traditional  approach  to  analysis  and  design  of  an  adaptive  control  system^  it  is 
assumed  that  the  unknown  plant  can  be  represented  by  a  model  in  which  everything  is  known 
except  for  the  values  of  a  finite  number  of  parameters.  Once  the  parameters  are  estimated  (and 
even  during  the  estimation  process),  the  principle  of  certainty  equivalence  is  normally  invoked 
to  update  the  controller.  Normally  the  unstructured  uncertainties  of  the  model  are  ignored  in 
this  approach.  Therefore  it  is  not  surprising,  as  pointed  out  in  Reference  5  that  these  adaptive 
controllers  are  often  not  robust.  Further,  the  extensions  of  the  traditional  approach  to  adaptive 
control  which  purportedly  cope  with  unstructured  (and  other)  uncertainties  involve  conditions 
which  are  often  hard  to  apply  or  to  grasp  intuitively  (see  e.g.  References  6-8).  A  further 
problem  with  the  traditional  approach  is  that  extreme  transient  excursions  are  possible  even 
when  global  convergence  and  asymptotic  performance  are  guaranteed.® 

To  be  more  specific,  we  consider  an  adaptive  control  system  as  shown  in  Figure  1,  where  G 
is  the  unknown  transfer  function  of  the  plant.  The  time  axis  is  divided  into  intervals  such  that 
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Figure  1.  Adaptive  control  system 


during  the  fth  interval  the  control  input  applied  to  the  plant  is  obtained  from  Ki,  where  Ki  is 
the  transfer  function  of  the  controller  designed  using  the  model  G,  ~i  obtained  at  the  end  of 
the  (/  -  l)th  time  interval. 

In  an  adaptive  control  problem  the  ulterior  objective  for  finding  G,*,  an  estimate  of  G 
updated  from  G,-i,  is  to  design  a  controller  A’i+i  better  than  Ki  such  that  certain  control 
objectives  are  improved.  For  example,  if  7^  represents  the  desired  complementary  sensitivity 
function,  then  we  may  like  to  have 


GKi  ^ 

GKi-i  ^ 

1  +  GKi 

«o 

l  +  GKi-j 

Implicitly,  this  means  we  would  like  to  minimize 

GKi  ^ 

1  +  GKi 

Since  G,  the  transfer  function  of  the  plant,  is  unknown,  we  could  only  base  our  design  of  Ki 
on  G/-1  such  that 


Ki  =  arg  min 

7 


Gi-\y 
1  +  Gi-i7 


V/ 


Note  that,  as  usual,  we  have  invoked  the  principle  of  certainty  equivalence.  However,  it  is 
important  to  realize  that 


GKi  ^ 
1  +  GKi 


is  not  necessarily  small  even  though 


Gi-xKi 

\+Gi-iKi 


-  7^ 


is  minimum.  This  partly  explains  why  traditional  adaptive  control  systems,  which  almost 
invariably  invoke  the  principle,  of  certainty  equivalence,  often  have  unsatisfactory  robustness 
properties. 

In  the  robust  control  approach  a  controller  is  designed  on  the  basis  of  a  nominal  model 
of  the  plant  with  the  associated  parametric  and  unstructured  model  uncertainties  explicitly 
taken  into  account.  Therefore  stability  robustness  is  guaranteed  and  performance  robustness 
is  achieved  sometimes.  The  weakness  of  this  approach  is  that  it  considers  only  the  a  priori 
information  on  the  model  and  neglects  the  fact  that  some  characteristics  of  the  plant  could  be 
learnt  while  it  is  being  controlled.  Therefore  the  robust  control  approach  tends  to  result  in  a 
conservative  design  in  terms  of  performance.  It  is  likely  that  a  posteriori  knowledge  about  the 
plant  could  be  used  to  reduce  the  conservatism  in  a  robust  control  design. 
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In  this  paper  we  present  a  new  approach  for  the  design  of  adaptive  robust  control  for  a 
stable  plant  and  the  related  control-relevant  system  identification.  A  preliminary  study  of  the 
problem  under  noiseless  conditions  is  reported  in  Reference  10.  The  main  differences  of  this 
paper  from  Reference  10  are  that,  instead  of  updating  the  models  through  finding  rational 
function  approximations  (in  the  sense)  of  certain  plant  parametrization  and  employing 
proper  but  non-strictly  proper  controllers,  here  we  perform  system  identification  using 
input-output  data  and  employ  strictly  proper  controllers.  It  was  also  shown  in  References  1 1 
and  12  that  an  iterative  approach  for  model  refinement  and  control  robustness  enhancement 
can  be  developed  in  the  context  of  an  Hz  control  problem. 

In  Section  2  we  describe  the  windsurfer  approach  to  adaptive  control.  In  Section  3  we 
apply  the  windsurfer  approach  to  an  adaptive  model-matching  problem  and  formulate  the 
related  control-relevant  system  identification  criterion.  The  relevance  of  the  system 
identification  criterion  to  the  adaptive  model-matching  problem  is  explored  further  in 
Section 4.  In  Sections  we  apply  Hansen’s  approach to  recast  the  closed-loop  system 
identification  problem  into  an  open-loop  system  identification  problem.  We  also  show  that 
with  appropriate  filtering,  the  criterion  used  in  the  open-loop  system  identification  is  highly 
relevant  to  the  windsurfer  approach  to  adaptive  model  matching.  In  Section  6  we  develop  the 
relation  between  the  approximate  identification  and  the  internal  model  control  (IMC)'*  method 
of  controller  design.  We  present  the  simulation  results  in  Section  7.  In  Section  8  we  conclude 
the  paper  and  attempt  to  give  some  reasons  for  the  success  of  the  method. 

2.  THE  WINDSURFER  APPROACH  TO  ADAPTIVE  CONTROL 

By  considering  how  humans  learn  windsurfing,  Anderson  and  Kosut*^  have  made  the 
following  observations. 

1 .  The  human  first  learns  to  control  over  a  limited  bandwidth  and  learning  pushes  out  the 
bandwidth  over  which  an  accurate  model  of  the  plant  is  known. 

2.  The  human  first  implements  a  low-gain  controller  and  learning  allows  the  loop  to  be 
tightened. 

On  the  basis  of  these  observations,  an  adaptive  robust  control  design  philosophy,  the 
windsurfer  approach^  is  proposed  in  Reference  13.  It  recognizes  that  the  plant  characteristics 
can  differ  greatly  from  the  estimated  model  at  any  one  time,  particularly  during  the  initial 
learning  stage.  In  the  new  design  approach  a  low-gain  controller  will  first  be  implemented  and 
the  control  bandwidth  will  be  small.  On  the  basis  of  learning  a  frequency  domain  description 
of  the  plant  in  closed  loop,  with  the  learning  process  progressively  increasing  the  bandwidth 
over  which  the  plant  is  accurately  known,  the  controller  gain  can  be  increased  appropriately 
over  an  increasing  frequency  band.  For  details  see  Reference  13.  Importantly,  in  the  method 
suggested,  the  necessary  closed-loop  system  identification  task  is  transformed  into  an  open- 
loop  system  identification  problem  through  the  use  of  coprime  fractional  representations  as 
discussed  in  References  14-16. 

It  was  shown  recently*’  that  the  best  model  for  control  design  cannot  be  derived  from  open- 
loop  experiments  alone.  The  controller  to  be  implemented  should  be  taken  into  account  by  the 
system  identification  experiments.  However,  this  controller  is  not  yet  available,  since  its 
determination  rests  on  the  results  of  the  system  identification  to  be  carried  out.  Hence  a  general 
solution  to  the  combination  of  system  identification  and  control  design  is  necessarily  iterative. 
Although  the  emphasis  of  Reference  17  is  on  the  problem  of  modelling  for  control  purposes, 
its  approach  is  very  similar  to  that  of  Reference  13. 
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In  the  next  section  we  would  like  to  illustrate  the  windsurfer  approach  by  considering  a 
model-matching  problem  in  the  context  of  adaptive  control. 


3.  ADAPTIVE  MODEL  MATCHING 

Let  G  be  the  unknown  transfer  function  of  the  plant  and  let  7^  represent  a  desired 
complementary  sensitivity  function.  We  wish  to  achieve,  through  iterative  system  identification 
and  control  design,  the  minimization  of  the  cost  function 

GK  ^ 

\  +  GK 

where  K  is  the  transfer  function  of  a  controller  to  be  designed. 

We  begin  by  designing  a  controller  A'l.o  to  stabilize  a  known  initial  model  Go,  which  may 
be  obtained  from  an  open-loop  system  identification  exercise.  If  K\^o  also  stabilizes  the 
unknown  transfer  function  G,  then  we  say  that  /Ti.o  robustly  stabilizes  Go.  Notice  that  we  use 
Kjj  to  denote  the  yth  controller  designed  using  the  /th  model  which  has  a  transfer  function  G/. 
In  general,  we  attach  the  subscript  y,/  to  a  transfer  function  to  denote  that  it  is  either  specified 
or  derived  on  the  basis  of  the  /th  model  for  the  plant  at  the  yth  iteration  of  control  design. 
Since  Go  may  involve  significant  uncertainties,  the  resulting  controller  ATi.o  may  not  be  able  to 
achieve  a  small  value  for 


GqKx^q 
1  +  GoA^i^o 


-  7^ 


while  robustly  stabilizing  Go-  In  general,  we  need  to  consider  how  to  handle  the  question  of 
securing  robust  stabilization  of  G,*  by  Kj,i.  This  is  bound  up  with  the  question  of  selection  of 
7^.  It  is  in  fact  to  be  expected  that  a  sequence  of  7^  will  be  selected  in  such  a  way  that  the 
end  control  objective  can  be  approached  in  stages.  We  shall  therefore  proceed  as  follows. 

In  association  with  each  of  the  models  G/,  a  sequence  of  controllers  Kjj  is  designed  such  that 


Kj,i  -  arg  min 

7 


Gi7 
1  +  Gi7 


vy 


(1) 


where  the  sequence  of  functions  Tjj  is  specified  with  7j+i,,*  normally  of  wider  bandwidth  than 
7^,/.  It  is  also  necessary  that  F?./  results  in  a  controller  K\/t  that  robustly  stabilizes  G/.  A  stage 
will  be  reached  (say  when  y  =  N)  where  the  bandwidth  of  the  nominal  closed-loop  transfer 
function, 


F/^,i 


GjKf^j 

1  +  GiKs,i 


(2) 


cannot  be  increased  further  without  causing  the  effects  of  model  uncertainties  in  G,*  to  be  too 
significant.  This  occurs  when  the  value  of 

WTsa-TnAU 


is  no  longer  small,  where 


GKnj 
1  +  GKs.i 


(3) 


is  the  actual  closed-loop  transfer  function  of  the  system. 

At  this  stage  it  is  necessary  to  improve  the  accuracy  of  the  model  in  such  a  way  that  is 
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relevant  to  the  control  objective.  This  means  that  we  should  try  to  find  an  updated  model  Gt+i 
such  that 


Gi+i 


=  arg  min 
e 


GKs.i 

eKs.i 

1  -f  GKH,i 

1  +  dKs,i 

(4) 


Once  G/+1  is  found,  we  can  continue  to  increase  the  closed-loop  bandwidth  by  repeating  the 
procedure  described  for  G,  previously.  However,  G,  +  i  should  be  used  instead  of  G,  and  we 
specify  a  new  sequence  of  functions  Tj,i+i  with  7^,/+i  =  7n.(-  The  iterative  process  is 
continued  until  the  end  control  objective  is  achieved  or  it  is  prematurely  terminated  because 
of  one  or  more  of  the  following  constraints: 


(1)  fundamental  performance  limitations  due  to  right-half-plane  poles  and  zeros  of  the  plant 
and/or  models’® 

(2)  unstable  model  is  obtained  (This  is  a  consequence  of  our  simplified  control  design 
method.  Appropriate  extensions  of  the  control  design  method^  allow  us  to  deal  with  this 
restriction.) 

(3)  finite  control  energy 

(4)  no  further  improvements  in  the  identified  model  can  be  made  for  a  reasonably  large  set 
of  input— output  measurements. 


4.  CONTROL-RELEVANT  SYSTEM  IDENTIFICATION 
It  should  be  noted  that  the  system  identification  criterion  formulated  in  Section  3, 


GKs.i 

OKs.i 

1  4*  GKfsfj 

1  +  6Kf^,i 

would  be  the  formulation  of  a  standard  rational  function  approximation  problem  provided 
that  G  were  known.  However,  as  opposed  to  an  approximation  problem,  we  are  here  dealing 
with  a  system  identification  problem  where  G  is  an  unknown  transfer  function  and  only  a  finite 
number  of  (possibly  noisy)  input-output  measurements  are  available.  Despite  this  apparent 
difference,  we  must  emphasize  that  equation  (5)  is  exactly  the  dual  of  the  criterion  developed 
by  Anderson  and  Liu”  in  the  controller  reduction  problem  based  on  closed-loop  transfer 
function  consideration,  where  their  plant  and  reduced-order  controller  are  replaced  by  our 
controller  and  estimated  model  respectively.  We  can  therefore  draw  a  similar  conclusion  that 
in  our  system  identification  problem  there  is  a  reduced  weighting  placed  on  the  range  of 
frequencies  where  the  loop  gain  is  high.  This  is  very  appealing  since  it  agrees  with  the  well- 
known  fact  that,  for  a  stable  closed-loop  system,  model  errors  are  more  tolerable  in  the  range 
of  frequencies  where  the  loop  gain  is  allowed  to  be  large.  More  importantly,  as  we  shall  explain 
below,  this  system  identification  criterion  will  enable  us  to  find  a  new  model  which  allows  us 
to  design  a  closed-loop  system  with  a  larger  bandwidth  than  what  the  original  model  would 
allow. 

If  we  rewrite  equation  (5)  in  the  form 
G,  + 1  =  arg  min 

we  see  immediately  that  it  is  the  product  of  the  actual  sensitivity  function 

1 


1  +  GKs.i 
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and  the  nominal  complementary-sensitivity-function-weighted  multiplicative  model  error 

I  \  ( G  ~  Gj  -f  i\ 

\1  +  Gj+i^Ar.,/ \  G,  +  i  / 

It  appears  that  the  frequency-weighting  function  in  equation  (6),  which  involves  the  unknown 
actual  sensitivity  function,  cannot  be  implemented  in  the  system  identification  procedure. 
However,  we  shall  show  in  Section  5  that  by  recasting  the  closed-loop  system  identification  into 
an  open-loop  system  identification  problem,  we  also  obtain  a  system  identification  criterion 
which  is  equivalent  to  equation  (6)  but  involves  only  a  known  frequency-weighting  function. 
Therefore,  for  the  purpose  of  understanding  the  effects  of  the  system  identification  criterion 
on  the  identified  model,  we  can  treat  the  frequency-weighting  function  in  equation  (6)  as  a 
known  quantity. 

Recall  that,  as  described  in  Section  3,  we  were  using  the  model  G,  to  design  a  sequence  of 
controllers  Kjj,  with  increasing  gain  over  an  increasing  range  of  frequencies,  such  that  the 
closed-loop  system  has  an  increasing  bandwidth.  At  the  stage  where  j  =  N,  the  gain  of  the 
controller  K^.i  has  become  so  large  that  the  high-frequency  model  uncertainties  associated  with 
Gi  are  no  longer  insignificant.  Any  attempt  to  increase  the  closed-loop  bandwidth  further  will 
cause  the  magnitude  of  the  nominal  complementary-sensitivity- function-weighted 
multiplicative  model  error 

/  GiKN.i  \(G-Gi\ 

\l  +  G//(rK.A  Gi  ) 

to  become  too  large  at  certain  frequencies,  such  that  the  system  may  lose  performance 
robustness  or  even  stability  robustness.  Therefore  the  gain  of  the  controller  Ks.i  will  be  limited 
and  the  actual  sensitivity  function 

1 

1  +  GKsj 

will  be  large  beyond  the  existing  limited  closed-loop  bandwidth.  From  equation  (6)  we  notice 
that  It  is  exactly  in  this  range  of  frequencies,  where  the  actual  sensitivity  function  has  large 
magnitude,  that  our  system  identification  criterion  wilt  penalize  the  nominal  complementary- 
sensitivity-function-weighted  multiplicative  model  uncertainties  of  the  new  model  Gi+i.  We 
could  therefore  expect  G,  +  i  to  have  smaller  model  uncertainties,  as  compared  with  G,,  near 
and  beyond  the  edge  of  the  closed-loop  bandwidth  that  can  be  achieved  with  G,-.  This  will 
allow  us  to  design  controllers  /0.,+i  that  lead  to  larger  closed-loop  bandwidth  than  was 
possible  with  G,.  Hence  we  can  say  that  criterion  (6)  is  control-relevant. 

To  make  concrete  the  above  discussions,  we  consider  an  example  where  the  plant  has  a 
transfer  function 


(7) 

(8) 


(9) 
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Frequency  (rad/s)  Frequency  (rad/s) 

Figure  2.  Frequency- weighted  multiplicative  model  errors 


then  it  can  be  shown  that  the  nominal  closed-loop  transfer  function  is  given  by 


(10) 


Therefore  X  is  the  nominal  -6  dB  bandwidth  of  the  closed-loop  system.  As  X  is  increased  to 
0*5  rads"*,  the  actual  closed-loop  unit-step  response,  zts  shown  in  graph  (a)  of  Figure  2,  has 
excessive  oscillations  because  the  model  uncertainties  associated  with  Go  are  no  longer 
insignificant.  This  is  apparent  in  graph  (b)  of  Figure  2,  which  shows  the  magnitude  of  the 
nominal  complementary-sensitivity-function-weighted  multiplicative  model  error  of  Go.  We 
have  also  shown  the  actual  sensitivity  function  in  graph  (c)  of  Figure  2,  which  indicates  that, 
if  it  is  incorporated  into  the  system  identification  criterion  (for  which  the  procedure  is 
described  later  in  Sections  5  and  6),  the  nominal  complementary-sensitivity-function-weighted 
multiplicative  error  in  the  new  model  Gi  will  be  penalized  in  the  range  of  frequencies  near  and 
beyond  the  existing  closed-loop  bandwidth  of  0*5  rads"*.  It  can  be  seen  from  graph  (d)  of 
Figure  2,  which  shows  the  nominal  complementary-sensitivity-function-weighted  multiplicative 
error  of  the  new  model  Gi,  that  this  is  indeed  the  case.  Therefore  the  new  model  Gi  will  allow 
us  to  increase  the  closed-loop  bandwidth  beyond  0-5  rad  s"*. 


5.  CLOSED-LOOP  SYSTEM  IDENTIFICATION 

We  first  review  a  method  for  closed-loop  system  identification  developed  by  Hansen  and  co¬ 
workers.  Subsequently,  in  Theorem  2,  we  demonstrate  that  with  appropriate  signal 
filtering,  Hansen’s  method  provides  a  suitable  framework  to  carry  out  the  control-relevant 
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system  identification  formulated  in  Section  3.  For  the  sake  of  expository  simplicity  we  shall 
consider  only  scalar  plants.  We  begin  with  the  following  theorem.^® 


Theorem  I 

U  K  =  XjY  is  a  controller,  where  X  and  Y  are  stable  proper  transfer  functions,  and  if  N 
and  D  are  stable  proper  transfer  functions  that  satisfy  the  Bezout  identity 

NX+DY= 1 


then  the  set  of  all  plants  stabilized  by  the  controller  K  is  precisely  the  set  of  elements  in 


=  /?  is  a  stable  proper  transfer  function] 


Consider  the  feedback  system  shown  in  Figure  3,  where  y  and  u  are  the  measured  output 
and  the  control  input  respectively,  e  is  an  unpredictable  white  disturbance  and  rj  and  r2  are 
user-applied  inputs.  It  is  assumed  that  Kjj  is  a  known  stabilizing  controller,  G  is  inexactly 
known  and  possibly  unstable  and,  as  is  standard,^’  H  is  imperfectly  known,  stable  and 
inversely  stable.  The  system  identification  problem  is  to  obtain  improved  estimates  of  G  and 
H  from  a  finite  interval  of  measured  and  known  data  {y,  u,ri,r2:  0  ^  <  T). 

Following  Hansen,  **  we  introduce  the  stable  proper  transfer  functions  Xj,i,  Yjj,  Nj,i  and 
Dj,i  which  satisfy 


Kj.i  = 


Yj/ 


NiXj.i  +  DiYj.i=\ 


The  interpretation  is  that  G,  is  a  known  but  imperfect  model  of  the  plant  which  is  also 
stabilized  by  Kj,i.  Applying  Theorem  1  as  shown  in  References  15  and  16,  there  exist  stable 
proper  transfer  functions  Rj,i  and  Sj,-,,  with  Sj,i  also  inversely  stable,  such  that 


Ni  +  RjjYj.i 
Di  ~  RjjXjj 


H  = 


Sj.i 


Di  -  Rj.iXj.i 


(11) 

(12) 


Figure  3.  Closed-loop  system 
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Figure  4.  Closed-loop  system  identification 


where  Rjj  denotes  the  parametrization  of  G  using  the  iih  model  and  its  associated  yth 
controller  Kjj. 

As  a  result,  system  identification  of  G  and  H  in  closed  loop  is  equivalent  to  system 
identification  of  the  stable  proper  transfer  functions  Rjj  and  Sjj.  Using  equations  (11)  and 
(12),  we  can  represent  the  feedback  system  as  shown  in  Figure  4. 

From  Figure  4  we  can  write 

0  =  Rjja  +  Sj,ie  (13) 

where 


However,  since 


and 


equation  (14)  can  be  rewritten  as 


a=  Xjjy+  Yjju 
(3  =  Diy-NiU 


u  =  Kjj(ri  -  y)-^  ri 


a  =  Xjjri  +  Yjjri 


(14) 

(15) 


It  is  important  to  observe  from  equations  (13),  (15)  and  (16)  that  a  depends  only  on  the 
applied  signals  n  and  ri  operated  on  by  known  stable  proper  transfer  functions  Xjj  and 
Yj,i  respectively,  while  /3  depends  on  the  measured  signals  y  and  u  operated  by  known  stable 
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proper  transfer  functions  £>/  and  Ni  respectively.  Moreover,  a  is  independent  of  the  transfer 
functions  G  and  //and  the  disturbance  e.  Hence  the  system  identification  of  G  and  //in  closed 
loop  has  been  recast  into  the  system  identification  of  Rjj  and  Sjj  in  open  loop. 

We  shall  next  state  a  result  which  is  highly  relevant  to  the  system  identification  step  of  the 
windsurfer  approach  to  adaptive  control. 


Theorem  2 

Let  the  controller  Kjj  stabilize  the  plant  G  and  the  model 

Ni 


Gi  = 


Di 


where  Ni  and  Di  are  stable  proper  transfer  functions,  and  let 

^  j.i 

where  Xjj  and  Yjj  are  stable  proper  transfer  functions  satisfying  the  Bezout  identity 

NiXj,i^DiYjj^\ 

Let  Gi  +  i  be  another  model  of  G,  also  stabilized  by  Kjj  and  therefore  having  a  description 


^  _Ni^-  rj,iYjj 


(17) 


where  rjj  is  a  stable  proper  transfer  function.  Also  define  the  filtered  output  error 

YjA0  -  rj,ia)  (18) 

where,  with  rz  =  0, 

a  =  Xjjru  P  =  Diy  -  NiU 

ri  =  reference  signal,  y  =  plant  output,  u  =  control  input 

Thus  f  is  an  error  arising  in  the  (open-loop)  identification  of  Rj,i  through  an  estimate  ry.i.  Then 
the  filtered  output  error  can  be  expressed  as 


=  /  GKj,i  Gi.iKj,i  \ 

\l^GKj,i  l  +  Gi^iKjJ 


ri  + 


1 


1  +  GKj,i 


He 


Proof.  See  Appendix  I.  □ 

Remark 

Notice  that  in  Theorem  2  it  is  necessary  that  Kjj  stabilizes  G  when  the  system  identification 
procedure  is  carried  out.  This  can  be  ensured  by  increasing  the  closed-loop  bandwidth 
smoothly  and  cautiously  in  the  controller  design  stages  (to  be  described  in  Section  6).  We 
would  always  detect  a  gradual  degradation  of  performance  robustness  (while  stability  is  still 
being  maintained  and  the  system  identification  procedure  is  being  carried  out)  before  the 
closed-loop  system  lost  stability. 


30 


ADAPTIVE  ROBUST  CXINTROL 


193 


Suppose  that  the  value  of 

_ GjKjj  II  ^  jgj 

1  +  GKj,i  1  +  GiKjj  II- 

has  become  large.  As  it  was  described  in  Section  3,  we  want  a  new  identification  of  G  via  G,  +  i 
for  which 

GKj,i _ Gi  +  \Kj,i  II  ^2Q) 

l-^GKj.i  l  +  G,  +  i/^j.,||- 

is  small.  We  are  going  to  use  the  r^.j-parametrization  of  Gi+i.  By  substituting  equations  (11) 
and  (17)  into  expression  (20)  and  noting  that 

we  can,  after  simplification,  conclude  that 

(21) 

1  +  LiJijJ  1  +  Lfi+lAy,/  00 

should  be  small. 

By  using  equations  (13),  (18)  and  (21),  we  immediately  see  that  (for  the  system  identiheation 
procedure  relevant  to  the  windsurfer  approach  to  adaptive  control)  the  appropriate  signal 
model  is 


where 


/3i  =  Rjjoci  +  V 


01  =  Yj,ip 
ai  =  Yj,ia 


and  V  is  the  term  related  to  the  disturbance  e. 


Remarks 


(i)  Note  that 


\  +  GKjj 

is  the  actual  closed-loop  transfer  function  of  the  system  and 

^  GiKjj 

1  +  GiKjj 

is  the  nominal  closed-loop  transfer  function  of  the  system.  Therefore,  using  similar 
substitutions  to  these  that  resulted  in  equation  (21),  we  can  obtain 

TjJ  -  fj,i  =  Yj,iXjARj,i  “  Rjj)  (25) 


However,  since 


Rjj  =  0  vy,  V/ 
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we  therefore  have 

Tj,i=  Yj.t^j.iRj.i  (26) 

By  comparing  the  argument  of  the  //.-norm  given  in  expression  (19)  with  the  left-hand  side 
of  equation  (26),  we  see  immediately  that  when  the  value  of 

GKjj  GiKj.i 

1  +  GKj,i  1  +  GiKj,i  CO 

has  become  large,  i.e.  when  the  closed-loop  properties  of  the  actual  system  (Ty.i)  are 
significantly  different  from  the  closed-loop  properties  of  the  nominal  system  (f,.,),  the  vdue  of 

II  lloo 

will  be  large. 

(ii)  From  the  signals  defined  in  Theorem  2,  we  observed  that  Rj,i,  the  transfer  function  to 
be  identified,  is  excited  by  the  signal  a,  where 


Since  the  nominal  closed-loop  transfer  function  of  the  system  is 

^  _  GjKjj 


1  +  GiKjj 


we  can  write 


Therefore  Xjj  will  have  large  magnitude  when  we  try  to  push  the  nominal  closed-loop 
bandwidth  beyond  the  nominal  open-loop  bandwidth.  Since  a  model  usually  has  its 
uncertainties  become  significant  for  frequencies  beyond  its  bandwidth,  from  Figure  5  we  see 
that  if  the  spectrum  of  r i  is  white,  we  automatically  get  the  right  weighting  for  the  input  to 
Rj,i  for  the  system  identification  scheme. 
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(iii)  It  is  shown  in  Theorem  2  that  the  effect  of  e  on  J  is  given  by 

He 

I  +  GKj,i 

Notice  that  this  is  the  effect  of  e  on  y  attenuated  by  the  sensitivity  function  of  the  actual 
closed-loop  system. 


6.  APPROXIMATE  IDENTIFICATION  OF  THE  Rj^i  TRANSFER  FUNCTION  FOR 

IMC  CONTROLLER  DESIGN 

In  Section  5  we  have  shown  that  the  closed-loop  system  identification  of  the  plant  transfer 
function  G  can  be  reformulated  into  an  open-loop  system  identification  of  the  stable  proper 
transfer  function  Rjj  that  parametrized  the  transfer  function  G  via  the  equation 

Ni^RjjYjj 

Di  -  Rj.iXjj 

In  this  and  the  following  sections  we  shall,  for  simplicity,  study  the  case  where  the  plant 
transfer  function  is  stable  strictly  proper  and  has  no  zeros  on  the  imaginary  axis  of  the  5-plane 
and  where  the  IMC  method"^  is  used  to  design  the  controller  Kjj.  We  shall  also  assume  that 
all  estimates  G/  of  the  plant  are  stable  strictly  proper  transfer  functions. 

If  the  model 


r 


is  stable,  we  can  let  M  =  G,  and  £»;  =  1  so  that 

G  =  C,  + 


1  ~  ^J.iQj.i 


(27) 


where  Qj.i  is  a  stable  strictly  proper  transfer  function  that  parametrized  the  strictly  proper 
controller 


and 


Yj.i 


Qj/-' 


Kj.i 


1  +  GiKj. 


(28) 


The  reason  for  requiring  Qj,i  and  hence  Kjj  to  be  strictly  proper  is  that  this  is  a  necessary 
condition  for  the  system  to  be  robust  in  the  presence  of  high-frequency  parasitic  or  singular 
perturbation.^^  We  also  have 

Xj.i  =  Qj.i  Yj.i=\-Qj.iGi 

Since  the  parametrization  of  G  by  Rj,i  depends  intimately  on  Qjj,  we  shall  briefly  explain  how 
Qj,i  is  obtained  in  the  design  of  the  controller  Kjj.  We  will  use  the  notations  nn  and  du  to 
denote  the  numerator  polynomial  and  the  denominator  polynomial  respectively  of  a  rational 
transfer  function  H. 
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Given  a  stable  model 


where  dc,  has  no  zeros  in  the  closed  right-half  s-plane,  if  tia,  has  no  zeros  on  the  imaginary 
axis  of  the  s-plane,  we  can  write 


y-t _ ^Gi  rij  (Zi  s) 

where  all  Zi  have  positive  real  parts  and  fie,  has  no  zeros  in  the  closed  right-half  5-plane.  By 
writing  G,  as 

G,  =  [C,]„[G,]. 


where 


[G,]m 


(z*  is  the  complex  conjugate  of  z/) 


fie,  n,  (z,*  +  5) 
dc, 


[G,]a 


n,-  (z,  -  5) 
IT/  (z  *  +  5) 


we  have  factored  G,-  as  a  product  of  its  minimum  phase  factor  [G,]m  and  the  associated  all-pass 
factor  (Gi]a.  We  can  find  the  controller 


Kjj  =  arg  min 
y 


Gi7 
1  -t-  Giy 


-7l.i 


with  Tjj  specified  as  T^j—  -/v./fG/lai  by  designing  a  detuned  //2-optimal  controller  for  a  step 
reference  input  using  the  internal  model  control  (IMC)  approach.''  This  is  achieved  by  setting 


Qj.i=  [G,]™‘F,-., 

where  Fj,-,  is  a  lowpass  filter  of  the  form 


(29) 


n  is  the  relative  degree  of  the  model  G,-  and  Xy,,  is  selected  small  enough  so  that  Kjj  robustly 
stabilizes  Gi. 

In  the  ideal  situation  where  G,-  =  G,  the  nominal  and  actual  closed-loop  transfer  functions 
of  the  system  are  equal  and  are  given  by  the  transfer  function  Fy.,  [G,],.  Therefore  Xy.,  is  both 
the  nominal  and  actual  closed-loop  system  bandwidth  with  a  -  3(/i  +  1)  dB  attenuation.  In 
general,  Gj  G  and  Xy.i  serves  only  as  an  approximate  bandwidth  of  the  actual  closed-loop 
system. 

With  the  controller  designed  using  the  above  procedure,  we  shall  now  show  that  the  transfer 
function  to  be  identified,  is  the  product  of  a  known  stable  proper  transfer  function  and 
an  unknown  stable  strictly  proper  transfer  function.  An  analysis  of  the  form  of  the  unknown 
factor  in  Rj,i  indicates  how  it  can  be  sensibly  approximated  by  a  low-order  transfer  function. 
We  shall  first  rewrite  equation  (27)  as 


G-G, 

1  +  QjAG  -  G,  ) 


(30) 
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Then  we  can  obtain,  after  substituting  equations  (28)  and  (29)  into  equation  (30)  and 
performing  some  algebraic  manipulations, 


Rj,i —  (  [^7i]m(^  "t"  ^j,i 


(■y+  X>.,)(do,nc  -  rfcnc,)~j 

dKjjdc  +  nKj_,nc  ) 


(31) 


Note  that  equation  (31)  can  also  be  written  as 


(32) 


where 

[GiU{s+\j,ir  (33) 

is  a  known  stable  proper  transfer  function  and,  other  than  the  factor  s  +  \j,i  in  the  numerator, 

^ Xy.i)(cfc,nc-  done,) 
dxjjda  +  nKj^nc 

is  an  unknown  stable  strictly  proper  transfer  function  that  depends  on  the  unknown  transfer 
function  G.  Therefore  the  problem  of  identifying  Rj,i  has  become  one  of  identifying  its 
unknown  factor  Rjj.  We  shall  summarize  this  important  result  in  the  following  theorems. 


Theorem  3 

Consider  a  plant  which  has  an  unknown  stable  strictly  proper  transfer  function  G  and  a 
model  with  a  known  stable  strictly  proper  transfer  function  G/.  If  G  and  Gi  have  no  zeros 
along  the  imaginary  axis  of  the  5-plane  and 

G/=  [G/]ra[Gi]a 

where  [Gi]m  is  the  minimum  phase  factor  of  Gi  and  [Gi]a  is  the  allpass  factor  of  Gi,  then  with 


and 


where  n  is  the  relative  degree  of  Gi,  the  controller 

_ Qll 


Kj.i 


1  -  Qj.iGi 

will  robustly  stabilize  Gi  for  all  sufhciently  small  values  of  Xy,,-  ^  0. 


Proof.  See  Chaps  4  and  5  of  Reference  4. 


□ 


Theorem  4 

Let  the  controller  be  designed  according  to  the  conditions  stated  in  Theorem  3;  then  the 
unknown  stable  strictly  proper  transfer  function  to  be  identified, 


Rj.i- 


G-Gi 

1  +  QjAG  -  Gi) 


198 


W.  S.  LEE  ETAL. 


can  be  factorized  as 

^j.i  = 

where  Rjj  is  an  unknown  stable  strictly  proper  transfer  function  to  be  identified  and  Rjj  is 
a  known  stable  proper  transfer  function  given  by 

where  is  the  nominal  closed-loop  system  bandwidth  with  a  “  3(/2  +  1)  dB  attenuation. 

Furthermore,  the  order  and  the  relative  degree  (rel  deg)  of  the  transfer  function  J^j,i  are 
respectively  given  by 

(order  of  Rjj]  =  (order  of  G]  +  (order  of  Gi)  -  (M+  A^)  +  1 
rel  deg {/?;./)  =  min(rel  deg(G},  rel  deg(G/)) 

where  M  is  the  number  of  common  zeros  in  G  and  G/  and  N  is  the  number  of  common  poles 
in  G  and  G/. 

Proof.  See  Appendix  II.  □ 


Remarks 

(i)  Note  that  the  factorization  of  Rjj  given  in  Theorem  4  is  naturally  induced  by  the  IMC^ 
controller  design  procedure  that  we  have  adopted. 

(ii)  The  poles  of  Rjj  are  the  poles  of  Ty,!,  the  actual  closed-loop  transfer  function  of  the 
system, 

(iii)  It  is  important  to  note  that  /?>,/  =  0  if  and  only  if  G  =  G/. 

(iv)  The  order  of  Rj,i  depends  on  the  order  of  G,  which  is  an  unknown. 

(v)  Although  Hansen’s  approach  enables  us  to  obtain  an  unbiased  estimate  of  the  transfer 
function  Rj,i,  it  should  be  noted  that  Rjj  has  more  parameters  to  be  estimated  than  G. 
Furthermore,  since  the  order,  hence  the  number  of  parameters  to  be  identified  in  increases 
while  the  magnitude  of  Rj,i  decreases  with  the  stages  of  iteration,  we  would  expect  that  under 
noisy  conditions  the  system  identification  problem  will  become  harder  as  the  iteration  process 
progresses.  There  is  an  obvious  analogy  in  the  windsurfing  situation.  The  better  is  the  skill  of 
a  windsurfer,  the  harder  it  will  be  for  him/her  to  improve  his/her  skill  further.  In  fact,  it  will 
take  a  long  time  under  extreme  conditions  to  improve  his/her  skill.  In  the  system  identification 
problem  for  Rjj  the  interpretation  is  that  strong  probing  signals  and  a  long  record  of 
measurements  are  necessary  to  achieve  even  a  slight  improvement  if  the  closed-loop  already 
has  good  performance  and  large  bandwidth. 

Since  we  do  not  know  the  order  of  Rjj  a  priori  and  since  we  are  going  to  identify  /?y,/ 
(actually  Rjj)  and  update  G/  to  G,*+i  when  the  step  response  of  the  actual  closed-loop  system 
exhibits  unacceptable  oscillations  and/or  overshoots  (associated  with  model  uncertainties),  we 
expect  Rj,i  to  have  complex  conjugate  poles.  Therefore  the  transfer  function  which  serves  as 
an  approximation  of  Rj,i  has  to  have  an  order  of  at  least  two.  Moreover,  since  the  smallest 
possible  relative  degree  of  a  strictly  proper  transfer  function  is  one  and  the  relative  degree  of 
G  is  unknown,  we  have  to  assume  that  the  relative  degree  of  Rj,i  could  be  one.  It  was  shown 
in  equation  (21)  that  the  system  identification  problem  is  to  find 

0./  =  arg  min  I1  -  a)ll« 

a 


(35) 
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If  we  define 


rj.i  =  Rj.irj.i 


(36) 


where  fjj  is  an  unknown  second-order  stable  strictly  proper  transfer  function,  then  by 
substituting  equations  (32)  and  (36)  into  equation  (35),  we  can  show  that  the  system 
identification  problem  becomes  one  of  finding 


rj.i  =  arg  min  |1  -  *^)1U 

* 

Therefore,  for  the  purpose  of  identifying  Rj.i,  the  signal  model  can  be  obtained  by 
appropriately  modifying  equation  (22)  and  is  given  by 


/3i  =  Rj.ioti  +  V  (38) 

where 

OLl-Rj/i(X\ 

and  ai,  |3i  and  v  have  been  defined  previously.  Notice  that  the  signals  /3i  and  in  the  model 
described  by  equation  (38)  can  easily  be  generated,  using  known  filters,  from  the  control  input 
u,  the  measured  output  y  and  the  reference  input  n. 


Remarks 

(i)  Since  Yj,i  is  the  nominal  sensitivity  function  of  the  closed-loop  system,  we  immediately 
see  that  the  frequency  shaping  in  the  identification  criterion  given  by  equation  (37)  will  force 
the  updated  model  to  have  small  modelling  error  at  the  edge  of  the  closed-loop  bandwidth 
where  the  nominal  sensitivity  function  cannot  be  made  small  by  the  controller  Kj.t. 

(ii)  It  is  important  to  ensure  that  the  input  is  sufficiently  exciting  when  we  are  carrying  out 
a  system  identification  experiment. 

(iii)  Under  noisy  conditions  the  signals  to  be  used  in  the  system  identification  process  should 
be  appropriately  lowpass  filtered.  In  a  discrete  time  implementation  this  can  be  accomplished 
by  an  anti-aliasing  filter, 

(iv)  When  updating  the  model  using  the  equation 


Gi  +  i  =  Gi  + 


n,i 

1  “ 


the  order  of  the  model  may  increase.  To  prevent  the  model  order  from  increasing  indefinitely, 
we  use  a  frequency-weighted  balanced  truncation  scheme’*  to  reduce  the  order  of  G,  +  i. 
Specifically,  we  find 


Gi  + 


=  arg  min 


Gi  +  \Kj/t 

vKj.i 

1  +  Gr  +  \Kj^i 

1  + 

where  (5,  +  i  is  the  reduced-order  model.  If  the  model  order  is  restricted  to  m,  the  controller 
will  be  at  most  of  order  m  +  1  (see  controller  design  equations  given  in  Theorem  3).  In  this  way 
the  controller  complexity  will  be  limited. 
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7.  SIMULATION  RESULTS 

With  reference  to  Figure  3,  we  shall  present  some  simulation  results  of  applying  the  windsurfer 
approach  to  the  control  of  a  system  with 


and  e  as  zero-mean  disturbance  with  a  constant  energy  density  of  0-0025  from  0  to  100  Hz. 
We  first  summarize  the  procedure  in  the  following  algorithm. 


Step  1 

Set  G,  =  Go,  where  Go  is  the  transfer  function  of  an  initial  model  of  the  plant. 


Step  2 

Factorize  G,  as 


G,=  [G,U[G,]a 

where  [G,]„  is  the  minimum  phase  factor  of  G,  with  a  relative  degree  of  n  and  [G,]*  is  the 
associated  allpass  factor  of  G,-. 


Step  3 

For  j  =  1  find 


Kjj  = 


Qj.i 


1  Qj.iGi 


with 


Qj.i= 

where  the  parameter  Xy.,  in  the  transfer  function 


IS  chosen  such  that  robustly  stabilizes  G,-  in  the  sense  that  the  filtered  (noisy)  step  response 
of  the  actud  closed-loop  system  has,  at  most,  few  oscillations  and/or  overshoots.  Stop  here 
If  such  a  robust  stabilizing  controller  cannot  be  found.  Also  stop  here  if  the  robust  stabilizing 
controller  results  m  a  closed-loop  system  which  meets  the  specified  bandwidth.  Otherwise 
proceed  to  the  next  step. 


Step  4 

Let  y  =  y+  1  and  set  \jj  =  +  e  for  small  e  >  0  and  redesign  the  controller  Kj  /  using 

the  equations  given  in  Step  3.  Stop  here  if  the  design  produces  a  robust  stabilizing  controller 
with  the  closed-loop  system  satisfying  the  specified  bandwidth.  Otherwise  repeat  this  step  if 
Kj,i  robustly  stabilizes  G,;  else  proceed  to  the  next  step. 
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Step  5 

Perform  control-relevant  system  identification  to  obtain  For  this  purpose  we  apply  an 
algorithm  such  as  least  squares  to  obtain  an  estimate  fjj  of  which  satisfies 

/3i=/?>./a2  +  u 

This  depends  on  using  the  signals 

^  ~~  02  —  I 

(We  actually  used  discrete  time  samples  of  /3i  and  02  and  an  output  error  algorithm  to 
construct  a  strictly  causal  second-order  estimate  from  which  a  continuous  time  strictly  proper 
fjj  was  obtained.)  Using  rjj^  the  model  is  updated  via  the  following  set  of  equations: 


Gi+i  =  Gi  +  Y 


£Jd _ 

n.iQhi 


Step  6 

If  G/+1  is  stable,  find  the  reduced-order  model 
Gi  +  i  =  arg  nyn 

Otherwise  stop  here. 


vKj.i 

1  +  GiJ^iKjj 

1  +  rjKjj 

Step  7 

Set  Gi  =  (5i  +  i  and  return  to  Step  2. 


Remarks 


(i)  In  the  algorithm,  system  identification  has  to  be  carried  out  when 

11  Ts.i  -  Ts,i  lU 

is  no  longer  small.  Broadly  speaking,  this  will  correspond  to  a  significant  difference  between 
the  designed  nominal  performance  (depending  on  G,*  and  Ksj)  and  the  actual  performance 
(depending  on  G  and  Ks,i)>  In  particular,  the  observed  step  response  may  exhibit  many  more 
oscillations  and/or  overshoots  than  the  designed  values.  This  is  not  of  course  the  same  thing 
as  guaranteeing  that  the  i/«  error  above  has  became  large,  but  neither  is  it  unrelated.  To  be 
more  precise,  we  define  the  peak  gain  of  a  system  whose  transfer  function  is  T  by 


11  T\U  = 


Tw\\, 
sup  \ — iP 

•*’11.*' 0  w  - 


This  is  also  equal  to  the  total  variation  of  the  system’s  unit-step  response*^  (roughly  the  sum 
of  all  consecutive  peak-to-valley  differences  in  the  unit-step  response).  It  can  be  shown that 
if  r  is  a  stable  strictly  proper  transfer  function,  then 

liriu<||r|i,^2p||r|U 

where  p  is  the  order  of  the  transfer  function  T.  Now  we  consider  the  peak  error 


ll7w.,--rN.ill. 


3^ 
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Since 

\\TN.i-fN.i\U>\\TN.i\U-\\fN.i\U 

then  if  the  observed  step  response  of  T/v.,-  exhibits  many  more  oscillations  and/or  overshoots 
than  the  designed  step  response  of  we  would  expect 

and  hence 

II  —  TV./  II 1  >  r,  £  >  0 

Since  the  peak  gain  also  provides  a  loose  lower  bound  for  the  gain,  it  is  likely  that 

becomes  large  when  the  observed  actual  step  response  exhibits  many  more  oscillations  and/or 
overshoots  than  the  desired  one.  This  explains  why,  in  the  simulation,  the  models  are  updated 
whenever  the  filtered  (noisy)  actual  step  response  exhibits  unacceptable  oscillations  and/or 
overshoots. 

(ii)  The  algorithm  used  to  obtain  an  estimate  fj,-,  of  Rj,,  cannot  be  expected  to  give  an 
optimal  //oo  estimate.  However,  note  that  efficient  algorithms  for  performing  system 
identification  are  still  lacking  and  the  corresponding  theory  is  still  not  well  understood.^*"” 

(iii)  Since  the  stability  robustness  of  the  closed-loop  system  for  each  \j,i  has  to  be  checked 
by  using  step  response  testing,  the  method  is  not  an  on-line  procedure.  In  fact,  at  this  stage 
of  development  it  is  an  off-line  iterative  identification  and  control  design  procedure. 

The  simulation  results  are  presented  in  Figures  6-8.  We  start  with  an  initial  model  which 
has  the  transfer  function 


In  all  these  figures  the  graphs  on  the  left  show  the  noisy  unit-step  responses  of  the  actual 
closed-loop  systems  and  those  on  the  right  show  the  corresponding  lowpass-filtered  signals. 
Graphs  (a)  and  (b)  of  Figure  6  show  the  responses  of  the  actual  closed-loop  system  with  a 
nominal  bandwidth  of  O'l  rads  '.  Note  that  overshoots  and  oscillations  are  absent  for  the 
response  in  graph  (b).  Graphs  (c)  and  (d)  of  Figure  6  are  for  a  nominal  closed-loop  bandwidth 
of  0-5  rads"*.  Note  that  the  response  in  graph  (d)  is  oscillatory  and  any  attempt  to  increase 
the  nominal  closed-loop  bandwidth  further  is  likely  to  lead  to  instability.  At  this  stage  it  is 
necessary  to  improve  the  accuracy  of  the  model  if  we  wish  to  increase  the  nominal  closed-loop 
bandwidth  further.  To  ensure  that  the  signals  are  sufficiently  exciting,  low-amplitude  sinusoids 
in  the  relevant  frequency  range  are  superimposed  on  the  unit-step  input  just  prior  to  system 
identification.  The  responses  are  shown  in  graphs  (a)  and  (b)  of  Figure  7.  The  updated  model 
has  a  transfer  function 

G  -  0'Q625285^-- 0-339685 -HO -279 

The  updated  model  Gi  is  used  to  redesign  a  nominal  closed-loop  system  with  a  bandwidth 
of  0-51  rads  and  the  responses  are  shown  in  graphs  (c)  and  (d)  of  Figure  7.  By  comparing 
graph  (d)  of  Figure  7  with  that  of  Figure  6,  we  observe  that  the  response  no  longer  has 
oscillations.  We  also  notice  that  the  rise  time  in  graph  (d)  of  Figure  7  is  about  twice  that  in 
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Figure  8.  Simulation  results  3 


graph  (d)  of  Figure  6.  Since  both  Go  and  Gi  have  the  same  relative  degree  n=l,  we  would 
expect  graph  (d)  of  Figure  6  and  graph  (d)  of  Figure  7  to  be  similar  to  the  unit-step  response 
of  the  nominal  closed-loop  transfer  function  [0 •  5/(5  +  0-5)]  ^  By  comparing  with  the 
computed  unit-step  response  of  the  transfer  function  (0 •  5/(5 -i- 0-5)]  ^  we  have  verified  that 
graph  (d)  of  Figure  7  is  very  close  to  the  desired  one.  If  we  continue  to  increase  the  nominal 
closed-loop  bandwidth  of  the  system,  we  obtain  the  responses  shown  in  Figure  8,  where  graphs 

(a)  and  (b)  are  for  a  bandwidth  of  1  rads”*  and  graphs  (c)  and  (d)  are  for  a  bandwidth  of 
2  rads  . 

The  frequency  responses  of  G,  Go  and  G,  are  presented  in  Figure  9.  Notice  that,  compared 
with  Go,  the  updated  model  G,  has  effectively  captured  the  effects  of  the  poorly  damped 
resonance  of  the  plant. 

For  the  purpose  of  comparison  we  present  in  Figures  10  and  11  the  corresponding  results 
obtained  using  the  procedure  described  in  Reference  10.  Recall  that,  as  we  have  mentioned  in 
Section  1,  these  are  obtained  under  noiseless  conditions  using  rational  function 
approximations  (in  the  sense)  of  the  plant  instead  of  identified  models.  It  is  also  important 
to  emphasize  that,  instead  of  strictly  proper  controllers,  proper  but  non-strictly  proper 
controllers  are  used  in  the  procedure  described  in  Reference  10. 

To  facilitate  comparison,  we  adopt  the  same  initial  model 


Graphs  (a)  and  (b)  of  Figure  10  show  the  unit-step  responses  of  the  actual  closed-loop  system 
for  nominal  closed-loop  bandwidths  of  0-02  and  0-04  rad  s”*  respectively.  Note  that  graph  (b) 


units 
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Graph  (c) 


Graph  (d) 
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Figure  1 1 .  Simulation  results  using  rational  approximations  of  the  plant 


of  Figure  10  shows  significant  oscillations.  After  the  model  is  updated  to 

_  -0-40945^ +  2-05725 +7- 175  ■ 

‘  5"  +  1-30275^  +  8-99085+  10-6411  I 

using  the  procedure  described  in  Reference  10,  the  unit-step  response  of  the  actual  closed-loop 
system  for  a  nominal  closed-loop  bandwidth  of  0-04  rads" ‘  is  improved  and  it  is  shown  ini 
graph  (c)  of  Figure  10.  Graph  (d)  of  Figure  10  shows  the  unit-step  response  when  the  nominal^ 
closed-loop  bandwidth  is  increased  to  0- 1  rad  s"‘.  Graphs  (a)  and  (b)  of  Figure  1 1  are  obtained 
for  nominal  closed-loop  bandwidths  of  0-5  and  1  rads"’  respectively  when  the  model  is  Gi.|| 
If  the  model  is  improved  to  I 

_  -0-406125^0-801965  +  6-3884 
^  5’ +  1  - 09775"  +  8  -  8825  +  9 -  3027  ■ 

the  unit-step  response  is  given  by  graph  (c)  of  Figure  11  for  a  nominal  closed-loop  bandwidth! 
of  1  rads"'.  When  the  nominal  closed-loop  bandwidth  is  increased  to  2  rads"',  the  unit-step 
response  is  as  shown  in  graph  (d)  of  Figure  11.  ■ 
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All  else  being  equal,  we  would  expect  the  noiseless  situations  to  give  better  results  than  the 
noisy  conditions.  However,  by  comparing  the  results  given  in  Figures  6-8  with  those  given  in 
Figures  10  and  11,  we  observed  that,  overall,  the  results  given  in  Figures  6-8  appear  to  be 
better  than  those  given  in  Figures  10  and  1 1 .  Therefore  we  can  conclude  that  strictly  proper 
controllers  are  less  sensitive  to  high-frequency  model  uncertainties  and  hence  require  less 
frequent  model  updates  when  we  attempt  to  increase  the  nominal  closed-loop  bandwidth  of 
the  system.  This  is  important,  because,  as  we  have  mentioned  before,  under  noisy  conditions 
the  system  identification  process  is  becoming  progressively  difficult  and  it  is  advantageous  to 
be  able  to  have  infrequent  but  accurate  model  updates. 

8.  DISCUSSION  AND  CONCLUSIONS 

We  have  reviewed  in  Section  1  the  strength  and  weakness  of  both  the  traditional  adaptive 
control  and  the  robust  control  design  methods.  These  methods  should  be  able  to  complement 
each  other  and  there  should  be  natural  ways  in  which  they  could  be  blended  harmoniously. 
We  proposed  that  one  of  the  possible  ways  is  by  the  windsurfer  approach  which  was  first 
mentioned  in  Reference  13.  We  have  shown,  by  simulation,  that  by  starting  with  a  (crude)  initial 
model  of  the  plant  and  a  (small-bandwidth)  robustly  stabilizing  controller,  the  bandwidth  of 
the  closed-loop  system  can  be  increased  progressively  through  an  iterative  control-relevant 
system  identification  and  control  design  procedure.  We  shall  highlight  the  following  points 
which  we  believe  are  reasons  for  the  success  of  the  approach. 

(i)  The  use  of  control-relevant  frequency  weighting  in  the  system  identification  criterion. 

(ii)  Updating  of  the  model  when  the  effects  of  its  error  are  no  longer  small  in  the  closed- 
loop  response.  This  will  ensure  that  model  uncertainties  are  emphasized  in  the  correct 
range  of  frequencies. 

(iii)  The  controller  designed  by  using  the  IMC  method  always  has  integral  action.  Therefore 
it  is  insensitive  to  model  uncertainties  at  low  frequencies  provided  that  the  gain  of  the 
model  at  low  frequencies  is  of  the  right  sign. 

(iv)  The  use  of  strictly  proper  controllers  to  reduce  the  required  number  of  model  updates 
through  identifying  the  Rj.-,  transfer  function,  a  difficult  task  under  noisy  conditions. 

(iv)  The  controller  designed  by  using  the  IMC  method  induces  a  natural  factorization  in  the 
parametrization  of  the  unknown  transfer  function  of  the  plant.  This  enables  the  system 
identification  problem  to  be  solved  efficiently. 

It  is  natural  in  any  discussion  of  adaptive  or  iterative  design  to  raise  the  question  of 
convergence.  We  consider  that  the  work  of  this  paper  implicitly  established  the  following 
practical  convergence  results,  which  are  confirmed  by  all  the  simulations. 

1.  Assume  that  a  certain  closed-loop  bandwidth  can  be  achieved  for  a  known  stable  plant. 
In  the  situation  where  the  same  plant  is  imperfectly  known,  it  is  possible  to  adapt  the 
closed-loop  system  by  the  method  described  such  that  the  same  closed-loop  bandwidth 
is  achieved,  assuming  that  the  noise  is  not  so  great  as  to  preclude  satisfactory 
identification. 

2.  For  a  specified  (fixed)  closed-loop  bandwidth  which  is  achievable  with  the  real  plant,  the 
actual  closed-loop  behaviour  approaches  very  closely  the  nominal  closed-loop  behaviour. 

In  conclusion,  we  would  like  to  emphasize  that  only  the  case  of  a  stable  plant  with  stable 
models  is  considered  in  this  preliminary  investigation.  We  hope  to  address  the  following 
problems  in  the  near  future. 
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(i)  How  to  come  up  with  an  initial  stabilizing  controller.  Simple  cases  where  the  plants 
have  single  or  double  poles  at  the  origin  have  already  been  addressed. 

(ii)  The  extension  of  the  method  to  deal  with  unstable  plants  or  models. 

(iii)  Use  of  orthogonalized  exponential  in  the  system  identification  procedure  such  that  it 
becomes  a  convex  optimization  problem. 

(iv)  To  prove  that  the  algorithm  actually  converges  in  some  sense. 

(v)  To  study  other  control  design  methods  in  the  context  of  the  windsurfer  philosophy.  . 


APPENDIX  I:  PROOF  OF  THEOREM  2 


Since  the  controller 


stabilizes  the  model 


Kja  = - 

Yj.i 


Ni 


Gi=-^.  NiXj.i+DiYj.i=l 
Di 

then  solving  equations  (40)  and  (41)  simultaneously,  we  get 

Kj.i 


Xjj  = 


Di  +  NiKj., 


1  V  l-NiXj, 

jji  ■  or  *  j.i . . . 

Di  +  NiKj.i  Di 


Substituting  Xjj  and  Yjj  into 


will  result  in 


Gi  +  i  = 


Nj^rjjYjA 

Di 


(40) 

(41) 

(42) 

(43) 


Gi+i  =  Gi  + 


rjj 


DiiJDi 


Solving  for  o,,-,  we  get 


Df(,Gi^i-Gi) 

l+DiXjAGi*i-Gi) 


From  Figure  3  in  Section  5,  with  ri  =  0,  we  can  write  for  the  closed-loop  system 

OKjj  1  ,,  Kja  Kja  „ 

y  ^  ;.j  4.  ^  .  He,  u  =  rx  -  7 - 1,.  He 


1  +  GKja  1  +  GKja 
Therefore  we  can  write  the  equation 


1  +  GKja  I  +  GKj, 


as 


and  the  equation 


0  =  Diy  ~  NiU 

DiKjAG-Gi)^  ^  Di(l  +  G,Kj,i) 
^  l+GKj,i  '  i  +  GKj,i 

a  =  Xjjri 


Kj.i 

a- - - - fi 

Did  +  GiKj.i) 


(44) 


(45) 


(46) 
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If  wc  form  the  output  error  defined  by 

r— /S  —  o.jof  (47) 

then  by  substituting  equations  (44)— (46)  into  equation  (47)  and  using  the  expression  for  Xjj  given  by 
equation  (42),  we  can  obtain 

_Dii\^GiKj^i)KjAG-Gi^i)  ^  (48) 

^  (l  +  GKj,,)(l+OutKj,i)  ‘  l+CKj., 

Since  equation  (43)  can  also  be  written  as 

Y  1 

D,{l+CiKj.i) 

it  is  clear  that  if  we  define  the  filtered  output  error  as 

k=Yj,,e 

then 


(  = 


+  l+GKj,i 


or 


\1  +  GKj,i  1  +  Gi*\Kj,i)  1  +  GKj,i 


□ 


Using  the  notations  established  in  Section  6,  we  have 

G-Gi 


APPENDIX  II;  PROOF  OF  THEOREM  4 

Kj., 


Ru=- 


+  QjAG-Gi)' 


Qj.i  = 


1  +  G,Kj,t 


Therefore  we  can  write 


We  also  have 


1  +  GiKj,i  „  . 


G.  =  lGi]„lG,l. 


where 


[G,]m  = 


nqFli  (^i  ^) 


[G,].= 


n<  (2<  -  s) 
n,-(z*+s) 


Since  Qj,t  =  [G<];‘F/.<,  we  can  rewrite  the  equation 

Kj.i  = 


Qjj 


as 


Hence  we  can  write 


Kj.i  =  - 


1  -  Qj.iGi 
dc,nF 


«c,[ni  (z,*  +  s)dFj,,  -  n,  (Zi  -  s)nFi.A 


\  +  GiKj,i  =  - 


n,  (Zj*  +  S)dFu 


(49) 


n,  (z  *  +  s)dFj.i  -  n,  (z.  -  s)nFj,, 
By  substituting  equation  (50)  into  equation  (49)  and  noting  that 


(50) 


dodo, 
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we  obtain 


where 

~  l^^ilin('S‘  +  Xy,t) 

is  a  known  stable  proper  transfer  function  and,  other  than  the  factor  s  +  Xy.,  in  the  numerator, 

^  (^  +  \j.i)(dG,nc  —  dgnci) 

dKj,idG  +  nKj,,nc 

is  an  unknown  stable  strictly  proper  transfer  function. 

To  obtain  the  results  on  the  order  and  relative  degree  of  Rj.i  we  shall  write 


[GiU  = 


[Gil.= 


T,(S)’  Pi{s) 

where  each  of  the  polynomials  ij.(s),  ir/(s)  and  pi(s)  has  degree  r,  «  +  r  and  m  respectively.  We  can  then 
obtain 

^ _ \l1'rds)pds) _ 

,,(5)1(5+ Xy./)"^'p/W- x;.fp.(-^)] 

If  we  also  write  G  as 

^  g(s) 

m 

where  a{s)  has  degree  p  and  0(5)  has  degree  q,  then  by  substituting  all  these  into  equation  (51).  we  get 

-  _ (5  +  \j,i)  [cx(5)X|(5)p/(5)  —  7?/(5)pi(  ^5)0(5)] _ 

""  0(5)77/(5)[(5+  Xi.,)"'"'pi(5)“  X;.t‘p/(-5)]  +  X;.T'a(5)T/(5)p/(5) 

By  counting  the  degrees  of  the  resulting  numerator  and  denominator  polynomials  of  Ajj  given  by 
equation  (52),  the  required  results  are  established  immediately.  □ 
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Unbiased  least  squa.res  estinia.tes  in  the  presence 
of  structure  incompatibilities 

Henk  Aling  ‘and  Robert  L.  Kosut 


Abstract  ;  When  the  system  identification  problem 
is  posed  in  a  stochastic  framework,  model  errors  are 
defined  as  the  sum  of  a  bias  and  a  variance  term.  The 
bias  term  is  caused  by  incompatibilities  between  the 
model  structure  used  with  the  identification  and  that 
of  the  true  system.  In  particular,  the  use  of  ARX 
models  almost  invariably  leads  to  bias  since  associated 
noise  model  has  no  zeros.  Results  are  known  on  the 
unbiasedness  of  the  first  n  Markov  parameters  of  n-th 
order  least  squares  ARX  model  estimates  for  systems 
that  are  excited  by  a  white  noise  input.  We  will  extend 
these  results  to  the  colored  input  case. 

Keywords:  Least  Squares,  Markov  Parameter,  Sys¬ 
tem  Identification,  Model  Reduction,  Unbiased  Esti¬ 
mation. 


Introduction 

Let  us  assume  that  we  want  to  identify  a  model  of  the 
following  linear,  time-invariant  discrete  time  system. 

yx  =  Gr(z)ut  4- 

Here,  yt  and  ut  are  the  observed  output  and  input 
sequences,  and  G  and  H  are  transfer  functions  in  the 
shift  operator  z  : 

G{z)  =  + 

H{z)  =  I-f  Hiz^^  4-  H2z-^  4- . . . 

G(z)  and  H{z)  are  general  in  the  sense  that  we  do  not 
assume  finite-dimensionality,  or  compatibility  with  an 
ARX  model  structure.  We  will  use  an  n-th  order  ARX 
model  to  identify  the  system: 

yt  4-  Aivt-i  4- ...  +  AnVt-Tv  =  Bout  + ...  +  4-  €t  (1) 

This  model  structure  is  represented  in  terms  of  trans¬ 
fer  functions  as 

yx  -  A{z)~'^ B[z)ut  4-  A{z)~^et 
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where 

t=l 

Since  the  all-pole  noise  model  structure  is  gen¬ 

erally  incompatible  with  the  noise  system  H{z)  ,  any 
estimate  G{z)  =  A{zy^B{^)  of  G(z)  will  be  biased. 
This  bias  can  be  reduced  significantly  by  increasing 
n  ,  thereby  allowing  more  degrees  of  freedom  to  fit  the 
noise  model.  Thanks  to  the  robustness  properties  of 
the  least  squares  method  this  can  oftentimes  be  done 
without  severe  errors  due  to  overfitting.  However,  if 
the  excitation  is  poor,  large  errors  in  the  frequency  re¬ 
sponse  of  the  model  may  still  result  in  the  poorly  ex¬ 
cited  regions.  A  better  approach  towards  unbiased  es¬ 
timation  is  given  by  the  following  known  result,  based 
on  the  impulse  response  of  the  estimated  ARX  model 
[4,  2,  5,  6]: 

Theorem  1  Lei  Ut  and  Ct  both  be  independently ^ 
identically  distributed  sequences,  and  lei  ut  and  Ct 
be  uncorrelated.  Then,  as  the  number  of  data  points 
goes  to  infinity,  the  first  n  4-  1  Markov  parameters 
Go,  ^  ^-iGn  I  obtained  from  the  n-ih  order  ARX  model, 
are  unbiased  estimoit  of  the  true  Markov  parameters 
Go, . .  -  ,  Gn  • 

The  surprising  property  expressed  by  this  theorem  is 
that  the  first  n  4- 1  Markov  parameters  are  estimated 
biasfree  even  in  the  case  where  the  ARX  model  struc¬ 
ture  is  incompatible  with  that  of  the  tme  system.  This 
has  been  proved  for  the  noisefree  case  [4,  2]  as  well  as 
for  the  noisy  case  [5,  6].  We  will  extend  this  theorem 
to  the  case  where  ut  is  colored.  The  theorem  is  partic¬ 
ularly  useful  for  reduction  of  high  order  ARX  models 
based  on  the  model  impulse  response  [3,  7]  . 

Definitions  and  normal  equations 

We  will  derive  a  direct  relationship  between  the 
Markov  parameters  of  the  true  system  and  the  ARX 
model  parameters,  based  on  a  straightforward  evalua^ 
tion  of  the  normal  equations.  We  define  a  parameter 


5n) 

T 


matrix  ©  and  regression  vector  by 

0  =  (  ^  B  )  =  (  >li  Bo  • 

^,  =  {-,u  ...  «f  ...  „r_„) 

Hence,  we  can  write  (1)  as 

yt  =  0<i>t  +  €t 

The  least  square  solution  is  determined  by  the  normal 
equations 

exx^  =  YX'^  (2) 

where  Y  and  X  are  matrices  containing  the  output 
and  regression  data: 


=( 


yi 


Vn 


)  1  -X"  -  ^  (^1  ...  (I>N  ^ 


To  derive  the  relationship  between  the  ARX  model 
parameters  and  the  true  Markov  parameters  (Gt,  Hi) 
we  have  to  introduce  some  more  notation: 

g  =  (  Go  Gi  G2  —  •  ) 

h  =  {l  B,  B,  E,  ...) 

Vt 


It  =  (  uf  uf_i  uj_2  ...) 

Et  =  ef_j  ...) 

The  input /outpuV^el^onship  can  now  be  written  £ls 

Vi  =  gUi  +  hEt  (3) 

To  put  this  in  matrix  form  for  all  t  =  1, . . . ,  iV  we  have 
to  add  the  definitions 

lf={ui  U2  U3  ...  Un) 

^  ^  El  E2  E3  . . ,  En  ^ 

We  can  now  write  (3)  for  all  t  as 


=  (»  ^(r) 


For  the  regression  matrix  X  we  can  define 


Q  = 


H  = 


(  0 

Go  Gi 

...  Gn^i 

...  \ 

0 

0  Go 

.  .  O 

0 

0  Go 

...  y 

{  ° 

/  El  ...  Er.-l 

...\ 

0  I 

...  ffn-J 

... 

\  0  0  ... 


Figure  1:  Impulse  responses  of  ARX  models,  of  order 
3,6,9,12:  white  input  case. 

and  express  the  matrix  of  regression  vectors  as 


Here,  Z  is  an  identity  matrix  extended  with  zeros  to 
the  right.  The*  normal  equations  are  now  written  com¬ 
pactly  as 


/  UU^ 


(4) 


Unbiased  Markov  parameter  estimation 

The  first  main  result  is  theorem  1  which  been  stated 
already.  The  proof  is  a  straight  consequence  of  the 
normal  equations  (4)  and  is  given  in  the  appendix. 

As  an  example,  we  have  estimated  modeb  of  orders  3, 
6,  9  and  12  on  simulated  data  (generated  by  a  single 
input/single  output  4-th  order  system  with  white  noise 
as  input,  and  a  signal  to  noise  ratio  of  1).  Their  im¬ 
pulse  responses  are  shown  in  Figure  1;  the  solid  line  is 
the  impulse  response  of  the  true  model.  The  result  is  a 
good  illustration  of  the  theory:  the  impulse  responses 
of  all  models  arc  identical  up  to  t  =  3  ,  after  which 
the  third  order  response  starts  to  differ  from  the  rest  - 
we  see  two  separate  lines.  After  t  =  6  the  sixth  order 
response  starts  to  differ  so  that  we  sec  3  different  lines, 
and  so  on.  The  fit  of  the  first  n-f  1  Markov  parameters 
is  based  on  asymptotic  results  amd  is  therefore  exact. 

A  first  extension  to  the  non-white  input  case  is  given 
by  the  following  corollary  of  theorem  1  .  The  idea  is 


1 


Figure  2:  20-th  order  ARX  model  impulse  response  vs. 
true  model  based  on  prefiltered  input. 

to  describe  the  uj  as  the  output  of  a  filter  with  a  white 
sequence  Wt  as  input,  and  use  lOt  as  input  for  the  ARX 
model.  The  proof  is  given  in  the  appendix. 

Corollary  2  Let  V (s)  be  the  spectral  density  function 
of  ut  and  let  it  have  the  spectral  factorization  V(z)  = 
W{z)T.^^W{z-\Y  [1]  .  Define  Wt  =  ,  and 

f(z)  =  A{z)~^B{z)  as  the  n-ih  order  least  squares  es¬ 
timate  of  an  ARX  model  with  input  Wt  and  output 
Pi  .  Let  G(z)  =  = 

f(z)W(z)“^  .  Then,  as  the  number  of  data  points  goes 
to  infinity,  Go,  are  unbiased  estimates  of  the 

true  Markov  parameters  Go, . . . ,  Gn  • 

The  corollary  is  illustrated  by  figure  2.  The  figure 
shows  the  comparison  between  the  impulse  response 
of  a  20-th  order  ARX  model  (dashed  line)  estimated 
from  data  generated  by  a  4-th  order  model  (solid  line) 
observed  in  additive  white  noise  (output  error  struc¬ 
ture,  as  opposed  to  ARX  which  is  an  equation  error 
structure).  The  order  r  of  the  input  filter  with  both 
poles  and  zeros  was  chosen  to  be  6,  and  the  signal  to 
noise  ratio  was  equal  to  1. 

This  result  is  a  straightforward  consequence  of  theo¬ 
rem  1  and  requires  the  availability  of  the  signal  ti;t  . 
Due  to  the  definition  of  G(z)  ,  the  plant  model  or¬ 
der  will  generally  be  higher  than  n  ,  the  order  of  the 
combined  plant/filter  system.  It  would  be  nice  if  we 
could  relate  the  the  Markov  parameters  of  a  n-th  order 
ARX  model  with  input  ut  directly  to  the  true  Markov 
parameters,  as  in  theorem  1  .  The  following  theorem 
proves  that  this  is  possible  indeed,  if  the  input  is  an 
auto  regressive  filtered  white  noise.  The  proof  is  given 
in  the  appendix. 

Theorem  3  Let  ut  be  a  quasi- stationary  sequence 
with  spectral  density  function  V{z)  of  the  form  V[z)  = 
W(z)E„„Vr(z-i)’’  _  >  0  an<i  where 


Figure  3:  20-th  order  ARX  model  impulse  response  vs. 
true  model  based  on  standard  least  squares, 

W{z)  is  an  auto  regressive  finite- dimensional  transfer 
function  described  by 

r 

w{z)  =  r>(2)”S  D{z)  =  I  +  XI 

i  =  l 

Let  Ci  be  an  independently,  identically  distributed  se¬ 
quence  uncorrelated  with  Ut  .  Assume  that  r  < 
n  .  Then,  as  the  number  of  data  points  goes 
to  infinity,  the  first  n  -f  1  -  r  Markov  parameters 
Go,...,Gn-r  I  obtained  from  the  n-th  order  ARX 
model,  are  unbiased  estimates  of  the  true  Markov  pa¬ 
rameters  Go, . . . ,  Gn^r  • 

The  theorem  b  illustrated  by  figure  3.  The  figure 
shows  the  comparison  between  the  impulse  response 
of  a  20-th  order  ARX  model  (dashed  line)  estimated 
from  data  generated  by  a  4-th  order  model  (solid  line) 
observed  in  additive  white  noise.  The  order  r  of  the 
autoregressive  input  filter  was  chosen  to  be  6,  and  the 
signal  to  noise  ratio  was  equal  to  1.  Therefore,  the  im¬ 
pulse  responses  are  identical  up  to  the  n-hl—T*  =  15  -th 
Markov  parameter. 


Conclusions 

Despite  the  incompatibility  of  the  ARX  model  struc¬ 
ture  with  that  of  the  underlying  system,  it  is  possi¬ 
ble  to  use  least  squares  to  obtain  unbiased  model  es¬ 
timates.  This  can  be  done  in  two  ways  which  leads  to 
cither  n+l  unbiased  Markov  parameters  if  a  prcfiltcrcd 
white  input  is  used,  or  n  +  1  —  r  unbiased  Markov  pa¬ 
rameters  in  case  the  input  is  the  output  of  an  r-th  or¬ 
der  Auto  Regressive  filter  driven  by  white  noise.  The 
results  derived  in  this  paper  are  based  on  an  asymp¬ 
totic  analysis. 


Then,  we  may  postmultiply  (9)  by  the  (n  +  1)  x  (n  -f  1  -  r) 
block  matrix 

/  / 


dT 


[7]  H.P.  Zeiger  and  A.J.  McEwen,  Approximate  linear 
realization  of  given  dimension  via  Ho*s  algorithm^ 
lEEE-AC,  vol.  19,  p,  153,  1974. 


:  I 

Df  DT 

V  Dr  / 

This  leads  to 


\ 


V  c- } 

(10) 

In  the  all-pole  case,  we  have  Ci  =  . . .  =  Cr  =  0  . 
Then,  (10)  leads  to  Ao  =  ...  =  An-r  =  0  .  Since 
G{z)  =  A(z)Sttu-C>(z)  ,  by  str^ghtforward  multiplication 
of  coefEcients  we  obtain 


Gi  =  Gi  (i  =  0,...,n-T) 

which  proves  the  theorem.  From  (10)  it  is  evident  what 
the  problems  are  when  W{z)  has  zeros  as  well.  Then,  it  is 
not  straightforward  how  a  similar  result  can  be  obtained. 
We  leave  that  czise  as  a  direction  for  future  research. 
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Appendix 

Proof  of  theorem  1 

According  to  our  assumptions,  we  may  set  U£^  and  £U^ 
to  sero  in  expectation.  The  right  block  column  of  (4)  then 
leads  to 

/  gUU^J^  \ 

E{(i  A  -S  )  guu^i^  }  =  o  (5) 

V  lUU^J^  j 

By  definition  of  Markov  parameters  we  also  have 

/  gUU'^l'^  \ 

(  /  A  -B  )  gwi/’’!’'  =  0  (6) 

Let  us  define  the  (t,  y)-th  block  element  of  UU"^  as 
y-_i  = 

t 

Note  that  this  term  depends  on  j  i  only,  due  to  the  sta- 
tionarity  assumption.  Subtracting  (5)  from  (6),  taking  ex¬ 
pectations  and  introducing  Gk  for  the  difference  Gk  —  Gk  , 
we  get 


(' 


Gtv 

Gn+l 

Go 

Gi 

Vo 

..  Kn  ' 

V-n 

..  Vb 

y-n-i  . 

..  VLi 

Proof  of  theorem  3 

In  case  the  input  is  colored,  the  expectation  of  the  term 
will  not  be  a  block  diagonal  matrix  but  a  block 
Toeplitz  matrix.  Assume  that  the  input  spectral  density 
function  is  given  by 


V(z)  =  W(z)E«„lV(x)*  ,  W(z)  ^  1 +  Y^Wiz-' 


where  (W(x),  Ew)  represent  the  standard  innovations  rep¬ 
resentation  of  V'(z)  according  to  the  spectral  factorisation 
theorem  [l].  Then,  (7)  becomes 


If  ut  is  white  noise,  this  implies  that  Vk  =  •  Sub¬ 

stituting  that,  it  is  easily  seen  that  (7)  leads  to  Gk  =  0 
(k  =  0, .  . . ,  n)  which  proves  the  theorem. 


^  Go  ...  6n+l 

\  Go 

I  ...  IVn  1  ...  \ 


I  Wi 
I 


diag(E  vt*)* 


.  ...  I  =0 

*yj+i  ••• 

V  :  / 

or,  defining  A(2)  =  G(z)W(z)E„„  = 

/  Ao  ...  An  I  An+t 


(i 


Ao  Ai 


Proof  of  corollary  2  .  jyj  ...  i  = 

pp'T  ^  ^  yp-T 

Let  the  joint  plant /filter  transfer  function  be  defined  as  n+i  i 

r(z)  =  G{z)W{z)  ,  and  let  W{z)  have  no  seros  on  the  y  J  •  y 

complex  unit  circle.  Thus,  W{z)  has  a  stable  inverse  and 

G(z)  =  r(z)lV’(z)“*^  .  Subtracting  Cj[z)  =  f(z)W’(r)  ^  Now  suppose  W{^z)  =  P(z)“^C(z)  with 
from  this,  we  get  (G(z)  —  G(z))lV'(z)  =  t(z)  — r(z)  .  Since 

by  theorem  1  f*,*  =  r(i)  (t  =  0,..-,n)  ,  the  coefficients  of  r 

z®, . . . ,  z^^  of  the  Taylor  scries  expansion  of  this  term  are  Diz~^  ,  G(z)  =  J  +  Ciz“* 

zero.  Thus,  G»  =  Gi  for  t  =  0, . . . ,  n  . 


A. 4  On  some  key  issues  in  the  windsurfer  approach  to  adaptive  control 
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On  Some  Key  Issues  in  the  Windsurfer  Approach  to 
Adaptive  Robust  Control 

W.  S.  LEE,t  B.  D.  O.  ANDERSON,* * * §  I.  M.  Y.  MAREELS§  and  R.  L.  KOSUT' 

In  the  context  of  iterative  identification  and  control,  where  the  control 
objective  is  to  maximize  the  bandwidth  of  the  closed-loop  system  while 
achieving  step  tracking,  we  provide  a  methodology  that  achieves  this  goal 

for  stable  systems. 


Key  Words — Adaptive  control;  robust  control;  internal  model  control;  identification. 


Abstract — We  examine  a  number  of  crucial  questions  that 
arise  in  the  windsurfer  approach  to  adaptive  robust  control. 
Considerations  are  limited  to  the  case  where  the  plant  is 
stable  and  has  no  zeros  on  the  imaginary  axis.  The  key 
conclusion  is  that,  given  a  strictly  proper  stable  model  of  a 
strictly  proper  stable  plant,  we  can  improve  the  performance 
robustness  of  the  closed-loop  system  through  the  windsurfer 
approach  if  the  plant  and  the  existing  model  have  no 
unstable  zeros  within  the  designed  closed-loop  bandwidth 
and  if  the  deterioration  in  performance  robustness  caused  by 
increasing  the  closed-loop  bandwidth  results  in  a  sufficiently 
high  signal-to-noise  ratio  for  a  certain  closed-loop  output 
error.  Situations  that  may  cause  the  iterative  identification 
and  control  design  process  to  terminate  prematurely  are 
identified.  A  simulation  example  is  used  to  illustrate  the 
results  discussed. 


1.  INTRODUCTION 

1.1.  Background  and  objectives 
A  new  adaptive  control  paradigm  known  as  the 
windsurfer  approach  was  first  introduced  in 
Anderson  and  Kosut  (1991).  The  objective  of 
this  approach  is  to  increase  the  bandwidth  of  a 
closed-loop  system,  if  possible  to  a  specified 
value  through  an  iterative  identification  and 
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control  design  procedure,  given  that  the  initial 
model  of  the  plant  may  involve  significant  error 
in  the  high-frequency  region.  Furthermore,  as 
the  closed-loop  bandwidth  is  being  increased,  the 
closed-loop  frequency  response  is  to  be  kept 
approximately  flat  in  the  passband  so  that  the 
closed-loop  transient  response  is  not  too 
oscillatory  or  having  excessive  peak  overshoot. 

Iterative  identification  and  control  design  is  a 
topic  of  growing  interest  (see  e.g.  Anderson  and 
Kosut,  1991;  Zang  et  ai,  1991;  Schrama,  1992; 
Schrama  and  Van  den  Hof,  1992;  Lee  et  al, 
1993;  Partanen  and  Bitmead,  1993).  Although 
these  schemes  are  different  in  detail  and  have 
been  proposed  for  achieving  different  control 
objectives,  they  have  all  originated  from  the 
awareness  that,  in  any  model-based  control 
design  task,  the  model  serves  no  other  purpose 
than  that  of  designing  a  controller.  It  is  therefore 
not  surprising  that  the  identification  criterion 
adopted  in  each  of  these  schemes  is  determined 
by  the  respective  control  performance  criterion. 
Furthermore,  the  controllers  employed  in  each 
of  these  schemes  are  designed  on  the  basis  of 
models  (except  possibly  the  first  one)  identified 
from  data  obtained  under  closed-loop  condi¬ 
tions.  An  historical  perspective  on  as  well  as  a 
tutorial  introduction  to  the  joint  design  of 
identification  and  control  may  be  found  in 
Gevers  (1993). 

A  scheme  for  the  windsurfer  approach  was 
presented  in  Lee  et  ai  (1993).  It  was 
demonstrated  by  simulations  that  the  bandwidth 
of  a  closed-loop  system  can  be  increased  by  the 
iterative  applications  of  the  internal  model 
control  (IMC)  method  (see  Morari  and  Zafiriou, 
1989)  and  a  closed-loop  system  identification 
procedure  pioneered  by  Hansen  (1989). 
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In  this  paper  we  examine  a  number  of  crucial 
questions  that  arise  in  this  approach.  When  can 
one  redesign  the  controller  and  expand  the 
closed-loop  bandwidth,  without  re-identifying; 
When  should  one  re-identify?  What  does  one 
want  to  identify  in  the  re-identification  process? 
What  can  one  identify  in  the  re-identification 
process?  How  can  an  identified  model  be 
verified  against  the  desired  purpose?  Will 
re-identification  always  lead  to  improved  closed- 
loop  performance?  Attention  is  restricted  to 
strictly  proper  stable  plants  with  no  finite  zeros 
on  the  imaginary  axis.  Extensions  to  more 
general  situations  are  currently  under  investiga¬ 
tion  (see  Campi  et  ai,  1994). 

1.2.  Structure  of  the  paper 

In  Section  2  we  describe  the  IMC  design  state 
of  the  windsurfer  approach  and  show  that  it  is 
safe  to  increase  the  designed  closed-loop 
bandwidth  gradually  if  the  plant  is  stabilized  by 
the  existing  controller.  Section  2  also  introduces 
some  of  the  key  concepts  and  notation  used  in 
the  paper.  Properties  of  good  models  for  the 
windsurfer  approach  are  established  in  Section  3. 
The  control-relevant  system  identification 
method  employed  by  the  windsurfer  approach  is 
described  in  Section  4.  Conditions  necessary  for 
identifying  a  good  model  and  methods  for 
verifying  experimentally  that  an  identified  model 
is  suitable  for  the  desired  purpose  (or  otherwise) 
will  be  given.  In  Section  5  we  study  mechanisms 
that  may  influence  the  iterative  identification  and 
control  design  of  the  windsurfer  approach. 
Situations  that  may  lead  to  the  premature 
termination  of  the  iterative  process  will  be 
indicated.  In  Section  6  two  methods  for  model 
validation  are  described.  A  procedure  for  the 
identification  of  a  better  model  while  avoiding 
the  potential  danger  of  causing  instability  in  the 
actual  closed-loop  system  will  then  be  suggested. 
Conditions  under  which  the  performance  robust¬ 
ness  of  a  closed-loop  system  can  be  improved 
through  the  windsurfer  approach  are  discussed 
in  Section  7.  A  simulation  example  is  presented 
in  Section  8.  We  conclude  the  paper  in  Section  9. 

2.  PRELIMINARIES 

In  this  section  we  describe  a  closed-loop 
system  where,  on  the  basis  of  a  strictly  proper 
stable  model,  a  sequence  of  controllers  is 
designed  for  a  strictly  proper  stable  plant.  This 
description  also  introduces  some  of  the  key 
concepts  and  notation  used  in  this  paper.  In 
particular.  Section  2.1  outlines  the  IMC  method 
in  the  manner  that  it  is  applied  in  the  control 
design  step  of  the  windsurfer  approach.  The 


concepts  of  nominal  performance  and  robust 
stability  are  then  introduced.  In  Section  2.2  it  is 
shown  that  we  can  increase  the  designed 
closed-loop  bandwidth  of  the  system,  while 
maintaining  the  stability  of  the  actual  closed- 
loop  system,  if  the  increment  is  sufficiently  small. 
We  conclude  this  section  with  a  definition  of 
performance  robustness  relevant  to  the  windsur¬ 
fer  approach. 


2.1.  Controller  design  in  the  windsurfer  approach 

The  IMC  method  is  applied  in  the  control 
design  step  of  the  windsurfer  approach  where 
the  reference  input  is  a  step  function.  Although 
the  IMC  method  is  generally  applicable  to  the 
case  where  the  plant  and  the  models  are  not 
necessarily  stable,  we  restrict  ourselves  to  the 
case  where  the  plant  and  the  models  are  strictly 
proper  and  stable.  In  this  situation  the  designed 
closed-loop  bandwidth  of  the  system  is  deter¬ 
mined  by  a  single  design  parameter. 

Consider  a  closed-loop  system  as  shown  in  Fig. 
1,  where  G  is  the  transfer  function  of  a  strictly 
proper  stable  plant.  A  sequence  of  such  models 
(identified  from  data  obtained  in  closed-loop) 
eventuates  in  the  windsurfer  approach.  We  use 
Gi  to  denote  the  iih  member  in  the  sequence  of 
strictly  proper  stable  models  {Go,  Gi,  G2, . . .}. 
On  the  basis  of  G/,  a  finite  sequence  of 
controllers  {/C®, /C], . . . , /C{}  is  designed  such 
that,  while  keeping  the  closed-loop  frequency 
responses  approximately  fiat  within  the  pass 
bands,  the  corresponding  closed-loop  band- 
widths  form  an  increasing  sequence 
{A?,  AJ,  . . , ,  A{}.  Note  that  we  shall  in  general 
use  Ki  to  denote  one  of  the  controllers  in  the 
sequence  {K^,  K], ,  K^i\  when  it  is  immaterial 
to  the  discussion  which  particular  controller  is 
involved.  Figure  1  shows  that 


Qi 

l-QQi' 


with  (2/  defined  in  the  IMC  method  by 


(1) 

(2) 


Fig.  I.  Internal  model  control  structure. 
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where  [G,]„,  is  the  minimum-phase  factor  of  G„ 
and 


is  a  suitable  IMC  filter  when  the  model  is  stable 
and  when  the  reference  input  is  a  step  function. 
The  integer  n  of  the  IMC  filter  is  chosen  such 
that  Qi  is  proper.  The  design  parameter  A,  must 
be  chosen  such  that  the  actual  closed-loop 
transfer  function 


TK, 

1  +  GKi 


(4) 


is  stable. 

It  can  be  shown  that  the  designed  closed-loop 
transfer  function 


f^_G£L_ 

'  1  +  GiKi 

can  be  written  as 

fi  =  G.Q,  (5) 

or 

(6) 

where  [G/]a  is  the  all-pass  factor  associated  with 
Gr 


Remarks. 

•  Since  [G/]a(ja))  does  not  affect  the  magnitude 
of  fi{](o),  it  is  clear  that  |7;(ja))|  is  flat  in  its 
passband  and  A/  is  the  designed  closed-loop 
bandwidth  with  an  attenuation  of  -3n  dB. 

•  Note  that  the  system  becomes  open-loop  when 
A/  approaches  zero.  Since  G  is  stable,  it  is 
always  possible  to  make  Ti  stable  by  choosing 
a  sufficiently  small  A/. 

Although  the  designed  closed-loop  transfer 
function  7)  is  always  well  behaved,  the  actual 
closed-loop  transfer  function  7]  may  become 
unstable  when  A/  is  too  large.  We  introduce  the 
following  definitions. 


Definition  1.  The  designed  closed-loop  system 
involving  TC,  has  robust  stability  if  the  stability  of 
%  implies  the  stability  of  7;.  Ki  is  said  to 
robustly  stabilize  G/. 


Definition  2.  For  any  two  closed-loop  systems 
designed  by  the  method  described  in  this 


subsection,  we  say  that  the  one  with  a  larger 
value  of  A/  has  a  better  nominal  performance. 


2,2.  Improving  nominal  performance  while 
maintaining  stability 

In  Section  2.1  we  have  indicated  that  a 
closed-loop  system  may  not  have  robust  stability 
when  A/  becomes  too  large.  Since  the  objective 
of  the  windsurfer  approach  is  to  increase  the 
closed-loop  bandwidth  to  a  specificed  value,  the 
following  question  appears  naturally: 

•  When  can  the  closed-loop  bandwidth  be 
increased  with  safety;  that  is,  without  losing 
robust  stability,  while  retaining  the  use  of  the 
model  Gil 

To  answer  the  above  question,  we  recall  from 
Lee  et  al  (1993)  that  if  7;  (corresponding  to  A,)  is 
stable  then  there  exists  a  strictly  proper  transfer 
function  such  that 


It  can  then  be  shown  that 

7;  =  Q/(1“  W,  (7) 

where  7)  =  7;  -  7)  is  the  error  in  the  closed-loop 
transfer  function  induced  by  the  error  in  the 
model  G,  when  the  designed  closed-loop 
bandwidth  is  A,. 

Suppose  that  the  designed  closed-loop  band¬ 
width  is  increased  to  A/  >  A,;  then,  corresponding 
to  A/,  we  can  write 

fl  =  0/(1  -  Ti)Rl, 

where 

, _ ^ _ 

Since  0/  and  f/  are  stable  by  design,  f/  and  Tl 
are  stable  if  and  only  if  Rl  is  stable.  However,  R] 
is  stable  if  A-  -  A,  >  0  is  sufficiently  small.  Hence 
we  have  the  following  conclusion. 


Conclusion  1.  We  can  increase  the  designed 
closed-loop  bandwidth  cautiously  if  the  existing 
closed-loop  system  has  robust  stability. 

Remark.  Note  that  even  when  7]  is  stable,  its 
response  to  the  reference  input  could  be 
significantly  different  from  that  of  if,  relative 
to  the  frequencies  where  G,  has  significant 
errors.  A,  is  not  sufficiently  small.  To  address  this 
issue,  we  now  introduce  the  concept  of  robust 
performance  that  is  relevant  to  the  windsurfer 
approach. 


Definition  3.  With  respect  to  the  given  reference 
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input  r  and  a  specified  finite  (usually  suitably 
small)  cr>0,  the  closed-loop  system  is  said  to 
have  robust  performance  with  designed  closed- 
loop  bandwidth  A,  if  and  only  if 

where  u,  =  T^r  is  the  tracking  error. 

Remarks. 

•  Robust  stability  is  necessary  for  robust 
performance,  but  it  is  not  sufficient. 

•  It  is  important  to  note  that  a  closed-loop 
system  may  have  high  nominal  performance 
(large  A,)  but  poor  robust  performance 
(7/  >  cr),  and  vice  versa. 

•  For  a  model  with  significant  modelling  errors 
in  the  high-frequency  region,  the  closed-loop 
system  can  be  designed  to  have  good  robust 
performance  if  the  designed  closed-loop 
bandwidth  is  sufficiently  small. 

•  While  A,  is  being  increased,  a  stage  can  be 
reached  (before  the  occurrence  of  instability) 
where,  because  of  the  modelling  errors 
associated  with  G,  making  a  significant 
contribution  to  7,  the  performance  robustness 
has  deteriorated  beyond  an  acceptable  level. 
At  this  stage  the  designed  closed-loop 
bandwidth  is  A,  =  A{,  and  we  decide  to  obtain  a 
more  accurate  model  before  continuing 
to  open  up  the  bandwidth. 

3.  PROPERTIES  OF  GOOD  MODELS 
In  Section  2,2  we  have  concluded  that  when 
performance  robustness  of  the  closed-loop 
system  has  deteriorated  beyond  an  acceptable 
level,  it  is  necessary  to  identify  a  model  better 
than  the  existing  one  before  the  designed 
closed-loop  bandwidth  can  be  increased  further. 

It  is  clear  from  Section  2.2  that  we  can 
increase  the  designed  closed-loop  bandwidth  as 
long  as  the  closed-loop  system  has  robust 
performance.  Therefore  it  is  natural  that,  when 
the  closed-loop  system  looses  robust  perfor¬ 
mance,  we  attept  to  seek  a  new  model  that  will 
allow  robust  performance  of  the  closed-loop 
system  to  be  restored  through  controller 
redesign  (while  the  designed  closed-loop  band¬ 
width  remains  unchanged).  This  prompts  us  to 
ask  the  following  question. 

•  What  would  we  tike  to  identify,  in  order  that, 
with  the  new  model,  robust  performance  of 
the  closed-loop  system  can  be  improved 
through  controller  redesign? 

Before  we  proceed  to  answer  the  last  question, 
we  should  observe  that  each  stage  of  the 
windsurfer  approach  involves  an  existing  model 


G/  and  an  updated  model  G/+i.  Since  every  stage 
of  the  iteration  proceeds  in  a  similar  fashion,  it 
suffices  to  discuss  only  the  stage  where  i  =  0. 
Therefore  we  shall  denote  the  existing  model  by 
Go  and  the  updated  model  by  G,.  This  system  of 
notation  will  carry  over  to  all  transfer  functions 
and  signals  involved  in  the  following  discussions. 

Suppose  that  Gi  is  identified  when  Aq  has 
reached  A{.  A  new  controller  will  then  be 
designed  on  the  basis  of  Gj  such  that  A?  has  the 
same  value  as  A^.  Obviously,  we  should  like 
y?=||f?rl|2  to  be  small.  By  using  (l)-(5), 
with  appropriate  adjustments  made  to  the 
notation,  we  can  write  T?  =  T?  -  T?  as 


r?= — — (i-f?).  (8) 

Gi 

Clearly,  it  is  necessary  that  f?  be  stable.  Since 
G  -  G|  is  unknown,  we  conclude  the  following. 


Conclusion  2.  We  would  like  to  identify  G  via  a 
G,  of  sufficient  accuracy  such  that  the  model  Gi 
satisfies  the  sufficient  condition  of  robust  stability 


<1. 


Furthermore,  we  observe  that  the  magjiitude  of 
the  designed  sensitivity  function  I  -  in  the 
right  hand  side  of  (8)  could  approach  a 
magnitude  significantly  greater  than  one  if  Gj 
has  unstable  zeros  within  the  passband  of 
7’?  =  F?[G,]a.  In  order  that  f?  have  a  small 
magnitude,  we  require  in  addition  to  the  above 
robust  stability  condition  the  following. 


Conclusion  3.  We  would  like  to  identify  G  via  a 
Gi  of  sufficient  accuracy  such  that 


G(jfa>)  -  Gi(j(u) 
G,(ja)) 


is  sufficiently  small  for  all  frequencies  above  the 
lesser  of  the  passband  of  T?  and  the  smallest 
critical  frequency  corresponding  to  the  unstable 
zeros  of  T?. 


Remarks. 

•  Observe  that  the  unstable  zeros  of  are 
those  of  Gi,  which,  in  a  situation  with  good 
identification,  will  be  those  of  the  plant  G. 

•  If  Gi  has  unstable  zeros  located  within  the 
passband  of  V],  it  is  likely  that  there  is  a  range 
of  frequency  within  the  passband  of  f?  where 
the  magnitude  of  the  designed  sensitivity 
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function  1  is  significantly  greater  than 

one.  This  has  the  following  consequences: 

(1)  there  is  a  range  of  frequency  within  the 
passband  of  T?  where  the  designed  system 
has  poor  disturbance  rejection  and  the 
measurement  noise  is  not  well  attenuated; 


4.1.  Control-relevant  system  identification 

It  was  indicated  at  the  end  of  Section  2.2  that 
when  the  designed  closed-loop  bandwidth  has 
reached  a  certain  value  denoted  by  the  robust 
performance  measure 

Ji=\\vi\\l 


(2)  since  the  magnitude  of  the  designed 
sensitivity  function  is  the  inverse  of  the 
distance  of  the  open-loop  frequency 
response  curve  from  the  critical  point  of 
stability  at  -1+jO,  the  designed  system 
may  have  poor  stability  margins  and 
transient  response  if  the  magnitude  of  the 
designed  sensitivity  function  is  excessively 
large  near  the  edge  of  the  system 
passband. 

For  these  reasons,  we  may  not  want  to 
increase  the  designed  closed-loop  bandwidth 
Ai  beyond  A{  if  Gi  is  found  to  have  unstable 
zeros  with  critical  frequencies  within  the 
passband  of  T{. 

4.  SYSTEM  IDENTIFICATION  IN  THE 
WINDSURFER  APPROACH 

Notwithstanding  the  fact  that  we  have 
established  in  the  last  section  properties  of  a 
good  model  for  the  windsurfer  approach,  it  is 
important  to  ask  the  following  question. 

•  What  can  we  identify  by  using  the  system 
identfication  procedure  embedded  in  the 
windsurfer  approach? 


associated  with  the  closed-loop  system  designed 
on  the  basis  of  Go  would  become  excessively 
large.  It  was  shown  in  Section  3  that  at  this  stage, 
we  like  to  identify  a  new  model  G,  such  that 


G(ja>)  -  G^{]o)) 

Gi(jct>) 


is  sufficiently  small  in  an  appropriate  frequency 
range.  Unfortunately  it  is  not  clear  how  to 
process  input-output  measurements  to  deter¬ 
mine  Gi  so  that  this  condition  is  naturally  or 
automatically  satisfied.  To  overcome  this 
difficulty,  we  use  input-output  measurements 
and  possibly  the  reference  input  of  the  stable 
closed-loop  system  as  shown  in  Fig.  2  to  identify 
Gi  such  that 


GK{, 

CM  \  ^ 

\1  +  GKi 

1  +  G,/Cy  : 

is  minimized.  This  closed-loop  identification 
problem  can  be  transformed  into  an  open-loop 
identification  problem  by  employing  Hansen’s 
(1989)  framework  of  identification.  We  state  this 
result  in  the  following  theorem.  It  is  a  special 
case  of  Theorem  2  in  Lee  et  al  (1993)  when  the 
plant  G  and  the  model  Go  are  stable. 


In  this  section  we  answer  this  question  in  three 
steps.  In  Section  4.1  we  show  that  consideration 
of  the  control  objective  leads  to  a  (closed-loop) 
control-relevant  system  identification  problem 
that  can  be  transformed  into  an  open-loop 
system  identification  problem.  This  transforma¬ 
tion  is  achieved  by  employing  an  identification 
framework  pioneered  by  Hansen  (1989),  where, 
instead  of  the  plant  itself,  a  strictly  proper  stable 
transfer  function  (to  be  denoted  by  jR^)  that 
parametrizes  the  plant  is  identified.  In  Section 
4.2  we  shall  show  that  it  is  possible  to  identify  Ri 
accurately  only  if  the  signal-to-noise  ratio  of  a 
certain  closed-loop  output  error  resulting  from 
the  existing  controller  is  high.  Furthermore,  by 
recognizing  the  relation  between  the  signal 
component  of  the  closed-loop  output  error  and 
deterioration  in  robust  performance,  we  can 
restate  the  conditions  necessary  for  obtaining  an 
accurate  estimate  of  in  terms  of  the  level  of 
deterioration  in  robust  performance  against  the 
effect  of  noise  disturbance.  In  Section  4.3  we 
show  how  to  verify  indirectly  that  an  estimate  of 
R{^  is  unbiased. 


Theorem  1.  Let  (1  -  GoeO'^Qo  stabilize 
G  and  Go,  where  Qo  is  a  proper  stable  transfer 
function,  so  that  G  can  be  parametrized  by  a 
strictly  proper  stable  transfer  function  via 


Let 


G  =  Go  + 


Gi 


=  Go  + 


1- Wo 


(9) 


Fig.  2.  Closed-loop  system  just  before  identification. 


1624 


W.  S.  Lee  et  ai 


be  another  model  stabilized  by  where  is  a 
strictly  proper  stable  estimate  of  Rq,  Also  define 

^,  =  (l-n)(i3-Ma),  (10) 

where  a  =  Q^,  /3  =  -  GoMqi  and  are 

respectively  the  input  and  output  of  the  plant 
resulting  from  the  application  of  K{,.  Then  can 
be  expressed  as 


gM  \ 

1  +  G,k0 


r  +  wi, 


where 

wi=(l-Ti)He 


(11) 

(12) 


is  the  effect  of  the  noise  disturbance  e  on  the 
actual  closed-loop  output. 


Remarks. 

•  If  we  define  H  =  {\-  R{)Q{))~^S^,  where  is  a 
proper  stable  and  inversely  stable  transfer 
function,  then  the  actual  closed-loop  system 
has  Hansen’s  open-loop  representation 

p  =  Ria  +  Sy.  (13) 


has  also  taken  the  data  filters  L  into 
consideration)  for  this  system  identification 
problem  is 

P  =  R{>a  +  ^ly, 

where 

a  =  L(1  -  fi)a, 

=  L(1  -  fO/3, 

•  Since  the  input’  a  in  (13)  (and  hence  a)  is 
independent  of  the  noise  disturbance  e, 
identifying  and  (or  equivalently  W^)  is 
an  open-loop  identification  problem. 

•  We  identify  /?^  and  using  a  prediction  error 
method  (see  Ljung,  1987)  as  shown  in  Fig.  3. 
Data  filters  L  (typically  low~pass)  are  usually 
employed  to  shape  the  bias-distribution  of  the 
estimates  (which  is  due  to  under-modelling) 
such  that  the  model  error  is  small  in  the 
appropriate  frequency  range. 

We  can  summarise  the  above  discussions  as 

follows. 


•  From  Theorem  1,  it  is  clear  that  minimizing 

GK{,  G,Ki  \ 

1  +  GK^  1  -t-  G.Kir  2 

with  respect  to  Gi  is  equivalent  to  minimizing 


Conclusion  4.  We  can  transform  the  closed  loop 
identification  of  G  into  an  open-loop  identifica¬ 
tion  problem  for 


G-Gq 

I  +  Q^G-GoY 


\\{\  -  GoQiX^  -  Hcr)\\l 

with  respect  to  provided  that  G|  is  updated 
according  to 

Note  that  the  appropriate  signal  model  (which 


4.2.  Accurate  identification  of  R{) 

In  the  following  we  show  that  the  problem  of 
identifying  R{^  accurately  can  be  solved 
effectively  (using  finitely  many  input-output 
measurements)  if  the  signal-to-noise  ratio  of  a 
certain  closed-loop  output  error  (to  be  defined 
immediately)  is  high.  In  particular,  the  normal¬ 
ized  variance  for  an  unbiased  estimate  of  Ri  is 
small  if  the  signal-to-noise  ratio  associated  with 
the  closed-loop  output  error  is  sufficiently  high. 

From  Fig.  4,  we  observe  that  the  closed-loop 


a 


Fig.  3.  Identification  of  /?(>  and  'Fq. 


Fig.  4.  Closed-loop  output  error. 
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output  error  is  defined  as  io  =  yo~^  T^r.  By 
substituting  the  expressions  for  a  and  p  into  (10) 
and  noting  that 


Qi 

1  -  GoQi 


(r  -  y^), 


we  can  obtain 

=  arg  min  Mi  -  pQii^  -  fiywl .  (14) 

P 

Now  we  can  use  the  fact  that  yi  =  T{f  +  (1  - 
r^)//e  to  write 

^i  =  vi  +  wi,  (15) 

where 

vi=fi/.  (16) 

Remark.  Note  that  the  tracking  error  vi  cannot 
be  measured  directly.  It  can  only  be  estimated 
from  the  closed-loop  output  error  ii. 


Wo,  under  the  condition  that  the  order  of  the 
discrete-time  model  for  Ai  (denoted  by  m)  and 
the  number  of  data  (denoted  by  M)  are  large 
and  the  ratio  m/M  is  small.  Since 

<!>„(, (w)  =  \QiQo>)[l  -  fiQ(o)]RiQw)f  «I>,(a)) 

is  the  power  spectral  density  of  vi,  we  can  write 
the  normalized  variance  of  Ai  as 

/[^{(jaj)-i?-^(j<D)f\  _  m  <l>H,f.(ft>) 

\|  Ri(j<o)  11/  M<J>4a») 

for  the  frequencies  where  /?i(jw)  5^  0. 

Remark.  For  a  finite  number  of  data,  the 
normalized  variance  of  Ai  can  be  small  only  in 
the  frequency  range  where  the  signal-to-noise 
ratio  associated  with  the  closed-loop  output 
error  is  sufiiciently  high. 


It  is  apparent  that  vi  is  the  signal  component 
in  ^i  that  carries  the  useful  information  about 
the  existing  modelling  errors  under  closed-loop 
condition,  and  wi  is  the  noise  component  in  ^i 
which  obstructs  the  determination  of  Ai. 
Therefore  we  can  draw  an  immediate  conclusion. 


We  now  summarise  the  above  discussion  as 
follows. 

Conclusion  6.  We  can  obtain  an  unbiased 
estimate  of  Ri  with  a  small  normalized  variance 
in  a  certain  frequency  range  a>i  ^  a>  ^  a»2  if 


Conclusion  5.  We  can  identify  Ri  successfully  if 
the  signal-to-noise  ratio  associated  with  the 
closed-loop  output  error  resulting  from  the 
existing  controller  Ki  is  high. 


(i)  the  model  set  used  in  the  estimation  of  Ri  is 
sufiiciently  general; 

(ii)  for  some  sufficiently  large  /x>0,  the 
following  condition  holds: 


We  next  show  that  the  normalized  variance 
for  an  unbiased  estimate  of  Ri  is  small  in  the 
frequency  range  where  the  signal-to-noise  ratio 
associated  with  the  closed-loop  output  error  is 
sufficiently  high. 

By  substituting  (15)  and  (16)  mto  (14)  and 
noting  from  (7)  that  Ti  =  0^(1  -  Ti)Ri,  we  can 
write 

Ai  =  arg  min  \\Qi(,l  -  fi){Ri  -  p)r  +  wi\\l. 

p 

In  practice,  we  use  sampled  input-output  data  to 
estimate  a  discrete-time  model  for  Ai  before 
converting  it  to  a  continuous-time  transfer 
function.  We  assume  that  the  errors  involved  in 
this  conversion  are  negligible.  Following  Ljung 
(1987),  we  write  the  variance  of  an  unbiased 
estimate  of  Ri  approximately  as 

E(|^{(ja>)  - /?{(ja))p) 

_ 

M\Qi{j<o)[\-fi{]<o)f 

where  is  the  power  spectral  density  of 


— - - for  0)1  ^  ^ 

It  is  clear  that  nothing  comes  for  free,  and  it  is 
prudent  to  ask  the  following  question. 

•  What  is  the  price  that  we  have  to  pay,  in  terms 
of  system  performance,  before  a  sufficiently 
high  signal-to-noise  ratio  of  the  closed-loop 
output  error  can  be  achieved? 

We  next  show  that  it  is  necessary  to  have  a 
certain  level  of  deterioration  in  robust  perfor¬ 
mance  (relative  to  the  effect  of  noise  distur¬ 
bance)  before  the  closed-loop  output  error  can 
achieve  a  sufficiently  high  signal-to-noise  ratio. 
By  using  (12)  and  (16),  we  deduce  that 

^  for  0)1  o)  ^  0)2,  0)1^  0, 

if  and  only  if 

l^{)(ja»)l'  >  M  I[1  -  THja>)]HQw)\-  4>,(o)) 

for  0)]  s  0)  ^  0)2  (i7) 
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where  is  the  power  spectral  density  of  the 

noise  disturbance  e.  Upon  integration,  we  get 


d>^(a>)  d(i) 


>-  pKl  - 

TT  -'<^1 


By  Parseval’s  theorem,  and  noting  that  wi^O, 
we  can  write 


•^o>~  f  |fo(j‘^)P^r(‘^)  dw. 

Therefore 

I  'l[l-n(ico)]H(j<of<P,(co)dw. 

We  can  now  restate  the  conditions  necessary  for 
the  estimation  of  as  follows. 


Conclusion  7.  We  can  obtain  an  unbiased 
estimate  of  with  a  small  asymptotic 
normalized  variance  in  a  certain  frequency  range 
of  interest  (a>i  <  <  (02;  o>i  >  0)  if 

(i)  the  model  set  used  in  the  estimate  of  is 
sufficiently  general; 


is  a  practically  unbiased  estimate  (or  an 
unfalsified  model  as  discussed  in  Ljung  et  ai, 
1991)  of  R{).  In  this  subsection  we  show  how  to 
infer  that  Ri  is  a  practically  unbiased  estimate  of 
Ri  by  verifying  that  {I  +  GiK^y^G^Ki  is  a 
practically  unbiased  estimate  of  (1  + 
GK{r'GK{y 
We  begin  by  considering 

/  GKj  G,K{,  \  1 

\i  +  G/c{,  i  +  g.a:^/  i  +  ga:{ 

Clearly,  if  (1  +  G^K{,)~'GiK{,  is  a  practically 
unbiased  estimate  of  (1  +  GK{)~'GK{^  then  the 
power  spectral  density  of  should  reflect  the 
effects  of  the  noise  disturbance  only.  We  can 
perform  this  verification  experimentally  after  G, 
is  obtained  (as  we  describe  in  Section  6).  Now 
recall  that,  if  it  is  necessary  to  update  the  model 
Go,  the  magnitude  of 


GKj 
\  +  GK{ 


GoK{ 

1  +  GoKi 


=  G^(l  -  %)R{ 


must  be  significant  in  a  certain  frequency  range 
[wi,(W2].  Therefore,  before  the  model  Go  is 
updated,  both  the_  magnitude  of  the  frequency 
weighting  0{(1  -  f{)  and  the  magnitude  of 
cannot  be  small  in  [w,,  <02].  Since  we  can  write 


GKj 
l  +  GKi 


G,K{, 

l  +  GM 


=Qi{i-n)iRi~Rii 


(ii)  there  is  robust  performance  deterioration 
bounded  below  by 

-  f  'l[l  -  Po(jw)]H(ico)\^%i(o)dw 

^  Jo., 


it  can  easily  be  deduced  that  if  (1  + 
GiKi)~^GyK{)  is  a  practically  unbiased  estimate 
of  (I  +  GK{i)~^GK{)  in  [cui,a>2]  then  R{i  is  a 
practically  unbiased  estimate  of  Ri  in  [<01,0)2]^ 
We  can  therefore  conclude  the  following. 


for  a  sufficiently  large  /x  >  0. 


Remark.  It  is  obvious  that  a  problem  is  ilLposed 
if  the  value  of  a  that  specifies  the  tolerable  level 
of  deterioration  in  robust  performance  (see 
Definition  3)  does  not  satisfy  the  inequality 


1  cr 

71 a 


for  a  sufficiently  large  >  0. 


4.3.  Practically  unbiased  estimation  of  Ri 

In  Section  4.2  we  have  shown  that  the 
normalized  variance  of  can  be  small  if  the 
signal-to-noise  ratio  associated  with  the  closed- 
loop  output  error  is  sufficiently  high.  However 
normalized  variance  can  be  used  as  a  measure  of 
the  quality  of  an  estimate  only  if  the  estimate  is 
unbiased.  It  is  therefore  necessary  to  verify  that 


Conclusion  8.  We  can  verify  that  is  a 
practically  unbiased  estimate  of  in  [a>i,  0^2]  by 
verifying  experimentally  that  (1  + 
is  a  practically  unbiased  estimate  of  (1  + 
G/Cf))-'G/C{>in  [a),.(02l 


5.  MECHANISMS  THAT  INFLUENCE 
PERFORMANCE  ROBUSTNESS  AND 
IDENTIFICATION 

In  this  section  we  study  mechanisms  that 
influence  performance  robustness  of  systems 
designed  by  the  IMC  method.  We  show  that 
there  are  three  mechanisms  that  may  lead  to 
deterioration  in  robust  performance.  However, 
only  one  of  them  contributes  to  the  high 
signal-to-noise  ratio  needed  for  a  successful 
estimation  of  /?(),  These  observations  allow  us  to 
deduce  situations  where  the  iterative  identifica¬ 
tion  and  control  design  process  may  continue  or 
may  terminate  prematurely. 
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Recall  that  in  Section  4.2  we  have  shown  that 
a  certain  level  of  deterioration  in  robust 
performance  is  necessary  before  we  can  attempt 
to  find  a  good  estimate  of  Ri.  However,  we 
should  ask  the  following  question. 


•  Does  it  mean  that,  irrespective  of  the  causes, 
deterioration  in  robust  performance  is  always 
helpful  to  the  identification  of  /?{? 


The  answer  is  obviously  no. 

With  appropriate  substitutions  in  (16)  and  (12) 
respectively,  we  can  obtain 


Go 

Do 


(1  -  fiy. 


(18) 


wi- 


1  +  ■ 


■°“n 


(1  -  T{,)He.  (19) 


Go 


Since  Ji=  |li^||i  we  observe  that,  disregarding 
changes  in  disturbance  suppression  ability, 
deterioration  in  robust  performances  is  governed 
by  the  value  of 


We  therefore  conclude  from  the  right-hand  side 
of  (18)  that  for  a  given  reference  input,  there  are 
three  factors  that  contribute  to  Ji  through 

1.  The  effect  of  the  term  [(G  -  Go)/Go]fo  in  the 
numerator  is  independent  of  the  phase  angle 
of  [(C  -  Go)/Go]T^.  We  call  this  the  phase- 
insensitive  factor. 

2.  The  effect  of  the  term  1  +  [(G  -  Go)/Go]ro  in 
the  denominator  depends  on  tl^  gain  and 
phase  margins  of  [(G  -  Go)/Go]T{.  We  call 
this  the  stability  margin  factor. 

3.  The  effect  of  the  term  I  -  fi  depends  on  the 
existence  of  unstable  zeros  of  Go  within  the 
passband  of  T{)  =  F{)[Gq]^.  We  call  this  the 
unstable -zeros -dependent  factor. 


By  using  (18)  and  (19),  we  can  write  the 
signal-to-noise  ratio  associated  with  the  closed- 
loop  output  error  as 


GQd))  —  Go(ja)) 
Gopco) 

m<of 


fiQoj) 


This  equation  indicates  that  for  a  given  reference 
input  and  noise  disturbance  scenario,  only  an 
increase  in  the  magnitude  of  the  phase 


insensitive  factor  can  increase  the  signal-to-noise 

ratio  of  the  closed-loop  output  error.  We  now 

summarise  the  above  discussions  as  follows. 

Conclusion  9. 

1.  There  are  three  factors  that  can  cause  the 
performance  robustness  to  deteriorate:  the 
phase-insensitive  factor,  the  stability  margin 
factor  and  the  unstable-zeros-dependent  fac¬ 
tor.  Among  these,  only  the  phase-insensitive 
factor  can  contribute  to  improving  the 
signal-to-noise  ratio  associated  with  the 
closed-loop  output  error. 

2.  When  the  unstable-zeros-dependent  factor  or 
the  stability  margin  factor  are  the  main  causes 
of  deterioration  in  robust  performance,  the 
signal-to-noise  ratio  associated  with  the 
closed-loop  output  error  may  be  poor,  and  it 
may  be  difficult  to  obtain  a  practically 
unbiased  estimate  of  Ri  with  a  small 
asymptotic  normalized  variance.  This  may 
cause  subsequent  difficulties  in  continuing  the 
iterative  identfication  and  control  design 
process. 

Remarks. 

•  From  (18),  it  is  clear  that  d>„/(co)  cannot  be 
large  in  the  frequency  range  where  the 
designed  sensitivity  function  has  small  mag¬ 
nitude.  This  implies  that  the  frequency  range 
[(01,0)2]  emphasized  in  Section  4  cannot  be 
well  below  A{. 

•  From  the  definitions  of  the  phase-insensitive 
factor  and  the  stability  margin  factor,  we  can 
deduce  that  it  is  possible  to  estimate  R^ 
accurately  only  in  the  frequency  range  where 
the  designed  complementary-sensitivity  func¬ 
tion  weighted  multiplicative  modelling  error 
has  large  magnitude  and  small  phase  lag.  This 
implies  that  the  frequency  range  [w  1,0)2] 
cannot  be  well  above  (where  7^  has  small 
magnitude  and  large  phase  lag). 

•  When  the  stability  margin  factor  or  the 
unstable-zeros-dependent  factor  are  the  main 
causes  of  deterioration  in  robust  performance, 
it  may  be  difficult  to  obtain  an  accurate 
estimate  of  This  may  lead  to  premature 
termination  of  the  iterative  identification  and 
control  design  process.  In  particular,  when  the 
existing  model  Go  has  unstable  zeros  within 
the  passband  of  the  designed  closed-loop 
transfer  function  7{,  the  designed  sensitivity 
(unstable-zeros-dependent  factor)  may  have 
large  magnitude  in  a  certain  frequency  region. 
This  fundamental  limit  in  control  performance 
(discussed  in  Freudenberg  and  Looze,  1985) 
causes  a  deterioration  in  designed  and  robust 
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performances  with  no  improvement  in  the 
signal-to-noise  ratio  associated  with  the 
closed-loop  output  error. 


6.  IDENTIFICATION  AND  VALIDATION  OF  NEW 
MODELS 

In  Section  4.2  we  have  shown  that  under  noisy 
conditions,  the  accuracy  of  the  identified  model 
can  be  improved  by  increasing  the  signal-to- 
noise  ratio  associated  with  the  closed-loop 
output  error.  It  was  also  shown  that  this  is 
equivalent  to  having  a  certain  level  of  deteriora¬ 
tion  in  robust  performance  relative  to  the  effect 
of  noise  disturbance.  It  is  clearly  undesirable 
from  the  control  point  of  view  for  robust 
performance  to  deteriorate  too  seriously,  while 
on  the  other  hand  it  is  necessary  to  have  a 
sufiiciently  high  signal-to-noise  ratio  in  the 
closed-loop  output  error  before  identification 
can  successfully  be  carried  out.  Furthermore,  it  is 
important  to  ensure  that  a  model  with  the  right 
properties  is  identified.  We  should  therefore  like 
to  ask  the  following  practical  questions. 

1.  When  should  we  try  to  identify  a  better 
model? 

2.  Have  we  actually  identified  a  good  model  for 
our  purpose? 

Before  we  can  answer  these  questions,  we 
need  methods  for  validating  an  identified  model. 
In  Section  6.1  we  describe  a  frequency-domain 
method  for  model  validation.  In  Section  6.2  we 
give  a  time-domain  method  for  model  validation. 
In  Section  6.3  we  draw  on  the  results  of  Lee  et 
ai  (1994),  which  compared  the  two  methods  of 
model  validation  and  suggest  a  procedure  for 
identifying  a  better  model. 

6.1.  A  frequency -domain  method  for  model 
validation 

In  the  following  we  present  a  model  validation 
method  in  the  frequency  domain.  It  should  be 
emphasized  that  the  model  validation  procedure 
is  designed  with  the  closed-loop  control  objective 
in  mind. 

Recall  that,  given  the  existing  model  Gq,  it  is 
necessary  to  identify  an  improved  model  Gi 
when  llvolll  is  excessively  large.  Evidently 
d  could  be  large  (implying  undesirable 
performance)  with  one  or  both  of  and 
large.  If  the  former  is  larger,  there  is  a  potential 
to  reduce  it  by  improved  model  identification. 
But  this  only  works  (in  a  particular  frequency 
band  [a>i,a>2])  if  the  signal-to-noise  ratio  is 
sufficiently  high.  Specifically,  when  only  finitely 


many  input-output  measurements  are  available 
for  identifying  (which  parametrizes  G),  it  was 
shown  in  Section  4.2  that  the  normalized 
variance  of  is  small  only  if  the  signal-to-noise 
ratio  <I>„/(a>)/<t„,/(w)  associated  with  =  + 

is  sufficiently  high.  Obviously,  then  one 
needs  to  estimate  power  spectra  for  Wq  and  Vq 
(or  more  precisely  ^o)*  We  now  proceed  as 
follows. 

From  Vo  =  {Ti)-fo)r  and  =  +  we 

observe  that  when  r  =  0,  the  sole  contributor  to 
fo  w  H'o-  Therefore  we  can  compute  ^^.^(fu)  after 
measuring  fo  with  r  =  0.  When  r¥^0,  we  have 
^o-Vq-^  Wq.  Assuming  that  Vq  and  Wq  are 
uncorrelated  (which  follows  if  r  and  e  are 
uncorrelated — a  typical  situation),  0^^,(a))  = 
By  visual  comparison  of 
^^o(^)  we  evaluate  the  significance 

of  ^>,,,(a>)  with  respect  to  If  <l>^(,(w)  is 

significantly  larger  than  ^>h.(,(^)  in  a  frequency 
band  spanning  one  decade  and  centred  around 
(when  the  designed  closed-loop  bandwidth  is 
A^),  the  model  Go  is  invalidated  for  the  design  of 
closed-loop  systems  with  bandwidths  larger  than 
or  equal  to  A{. 

The  method  just  described  can  also  be  used  to 
validate  G]  after  it  has  been  identified  (both 
before  and  after  model  reduction  is  performed). 
We  simply  replace  Gq  by  Gi,  while  retaining 
in  the  simulation  of  the  designed  closed-loop 
response  to  the  reference  input.  This  allows  us  to 
compute  and  its  power  spectrum  0^,(cu).  By 
visually  comparing  with  ^>4(0)),  we  have 

good  confidence  that  Gi  is  a  reliable  model  of  G 
(when  the  designed  closed-loop  bandwidth  is  A{) 
if  is  comparable  to  <I>^v{,(ct>)  up  to  A{. 

6.2.  A  time-domain  method  for  model  validation 

We  now  describe  a  time-domain  model 
validation  method.  This  is  useful  both  for 
establishing  that  Gq  should  be  rejected  (that  is, 
as  a  flag  for  re-identification)  as  well  as  for 
validating  a  new  model,  Gi,  replacing  Gq. 

Referring  to  Fig.  3  and  (10),  we  notice  that 
e^  =  L^i  when  =  where  e^  is  the  prediction 
error  (also  known  as  the  residual).  We  also 
observe  from  (9)  that  Gi  =  Gq  when  ki  =  0,  and 
from  (11)  that  when  G,  =  Gq.  Therefore 

we  have  e^  =  Lf ^  when  =  1  and  Ri  =  0.  This 
suggests  that  Go  should  be  rejected  if  the 
cross-correlation  of  the  prediction  error  e^  with 
the  future  values  of  ‘input'  a  exceed  its  (3cr) 
confidence  limits  when  Wi  =  1  and  =  0.  This 
reasoning  is  independent  of  the  true  See 
Ljung  (1987)  for  more  details  of  model 
validation  by  correlation  techniques.  (Actually  it 
is  also  easy  to  apply  the  same  method  to 
validate  a  pair  of  newly  identified  Ri  and 
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before  is  used  to  calculate  Gj.  We  simply 
check  that  the  correlation  of  with  the  future 
values  of  a  are  within  their  respective  confidence 
intervals.) 

6.3.  Identification  of  a  better  model 
The  methods  of  model  validation  described  in 
Sections  6.1  and  6.2  were  compared  critically  in 
Lee  et  al  (1994).  The  key  conclusions  of  that 
study  are  summarized  in  the  following. 

•  Correlation  function  estimates  and  power 
spectrum  estimates  are  both  useful  for  model 
validation  where  the  goodness  of  fit  is  based 
on  a  closed-loop  control  criterion. 

•  Correlation  function  estimates  are  more 
sensitive  than  power  spectrum  estimates  in  the 
sense  that  the  former  tend  to  invalidate  a 
model  before  identifying  a  better  model  is 
necessary  and  possible.  This  does  not  imply 
that  the  correlation  method  is  useless.  On  the 
contrary,  it  suggests  that  the  correlation 
method  is  useful  for  detecting  incipient 
modelling  errors. 

•  Power  spectrum  estimates  not  only  suggest 
when  a  model  becomes  inadequate,  but  they 
also  indicate  the  frequency  range  in  which  the 
signal-to-noise  ratio  is  high  for  identification. 

•  There  is  a  limit  on  the  achievable  accuracy  for 

performing  identification  on  closed-loop  sys¬ 
tems  if  existing  controllers  are  designed  on  the 
basis  of  models  with  unstable  zeros.  (Recall 
the  effect  of  the  unstable-zeros-dependent 
factor  remarked  upon  at  the  end  of  Section  5.) 
Therefore  prior  knowledge  of  unstable  zeros 
in  the  existing  model  (say  Go)  is  important. 
Specifically,  let  o)^  be  the  minimum  critical 
frequency  corresponding  to  the  unstable  zeros 
of  Go;  simulation  experience  confirmed  that  it 
is  very  difficult,  if  not  impossible,  to  identify  a 
model  better  than  Go  if  It  should  be 

remarked  that  this  is  reminiscent  of  design 
tradeoffs  discussed  in  Freudenberg  and  Looze 
(1985),  as  opposed  to  an  ill-posed  problem. 

•  In  general,  we  update  Go  if 

(i)  both  methods  of  model  validation  suggest 
so;  and 

(ii) 

We  now  suggest  a  procedure  for  identifying  a 
better  model.  Notice  that  in  the  frequency  range 
where  the  current  model  Go  has  significant 
modelling  errors,  the  signal-to-noise  ratio  of  the 
closed-loop  output  error  can  be  increased  by 
increasing  the  magnitude  of  the  reference  input 
or  by  increasing  the  designed  closed-loop 
bandwidth.  If  practical  operation  constraints  do 
not  allow  the  magnitude  of  the  reference  input 


to  be  increased  then  the  signal-to-noise  ratio  of 
the  closed-loop  output  error  can  only  be 
increased  by  increasing  the  designed  closed-loop 
bandwidth.  This,  however,  has  the  potential 
danger  of  causing  instability  in  the  actual 
closed-loop  system  if  the  designed  closed-loop 
bandwidth  is  increased  excessively.  To  avoid  this 
danger,  we  proceed  as  follows. 

1.  Reduce  the  rate  of  increasing  the  designed 
closed-loop  bandwidth  Aq  once  the  correlation 
method  for  model  validation  has  invalidated 
Gq. 

2.  Attempt  to  identify  (when  Aq  =  A{)  as  soon 
as  the  power  spectrum  method  for  model 
validation  suggests  that  has  a  sufficiently 
high  signal-to-noise  ratio,  provided  that 
Aq  2^z* 

(a)  Use  the  collected  data  to  identify  a  set  of 
models  by  experimenting  with  the  likely 
model  structures.  Perform  model  verifi¬ 
cation  on  each  of  these  models. 

(b)  If  an  identified  model  is  found  to  be 
sufficiently  accurate,  accept  it  for  the  next 
stage  of  control  design.  Otherwise, 
increase  the  designed  closed-loop  band¬ 
width  slightly,  collect  a  new  set  of 
measurements  and  repeat  the  procedures 
of  model  estimation  and  verification. 

(c)  Repeat  the  last  two  steps  until  a 
sufficiently  accurate  model  is  obtained 
and  verified. 

3.  Terminate  the  iterative  identification  and 

control  design  procedure  if  and 

although  unacceptably  large,  does  not  facilit¬ 
ate  the  identfication  of  a  better  model. 

7.  ROBUST  PERFORMANCE  IMPROVEMENT 
Now  we  know  what  can  be  identified  and  how 
an  identified  model  can  be  validated.  We  have 
also  indicated  in  Section  3  what  we  would  like  to 
identify.  It  is  therefore  logical  to  ask  the 
following  question. 

•  How  does  the  object  which  we  can  identify 
relate  to  the  object  which  we  would  like  to 
identify? 

The  answer  is  that  the  objects  are  virtually  the 
same,  although  it  is  not  obvious.  What  we  can 
identify  is  couched  in  terms  of  /?{,  and  what  we 
would  like  to  identify  is  couched  in  terms  of  G]. 
We  need  to  connect  these  characterizations.  In 
this  section  we  show  that,  provided  that  certain 
conditions  are  satisfied,  the  controller  designed 
on  the  basis  of  the  model  Gi  updated  through  an 
estimate  of  Ri  can  improve  the  performance 
robustness  of  the  system. 

Recall  from  (17)  that,  just  before  we  attempt 
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to  update  the  model  Go  through  identifying  it 
is  necessary  that 

>  |[1  ~  TU](x))\H (jo))!^  for  0)1  <  a;  <  CJ2 


for  a  sufficiently  large  ^t>0.  Furthermore,  it  is 
also  necessary  that  ifiQa))]  in  the  above 
inequality  be  mainly  contributed  by  the  phase- 
insensitive  factor  before  an  accurate  estimate  of 
can  be  obtained.  This  implies  that  in  order  to 
improve  the  robust  performance  through  iden¬ 
tification  and  redesign,  it  is  necessary  that  the 
phase-insensitive  factor  (which  is  also  the 
complementary  sensitivity  weighted  multiplica¬ 
tive  factor  (which  is  also  the  complementary 
sensitivity  weighted  multiplicative  modelling 
error)  associated  with  the  updated  model  Gi  and 
the  redesigned  controller  (while  keeping 
A?  =  A^)  be  small  in  the  frequency  range 
[o)  1,0)2].  Hence  it  is  relevant  to  consider  the 
magnitude  of  the  ratio 


(C_^ 
V  G. 


G  Gq 
Go 


in  the  frequency  range  [o),,  0)2]. 

Before  the  main  results  are  presented  in 
Theorems  3  and  4,  we  state  a  theorem  (which 
follows  directly  from  Theorem  4  of  Lee  et  ai, 
1993)  that  is  relevant  to  the  choice  of  the 
relative  degree  of  and  establish  two  lemmas 
that  we  use  in  the  proof  of  Theorem  3. 


Theorem  2.  Let  the  controller  and  the  proper 
stable  transfer  function  Q{^  designed  by  the  IMC 
method  described  in  Section  2.1.  Then  the 
relative  degree  of 


°  l  +  Q{(G-Go) 


is  given  by 

rel  deg  {/?{}  =  min  (rel  deg  {G},  rel  deg  {Gq}). 

Remark,  The  relative  degree  of  the  strictly 
proper  plant  G  is  usually  unknown.  It  is 
therefore  necessary  to  allow,  in  the  identification 
of  the  relative  degree  of  R{^  to  take  the 
smallest  possible  value  of  one. 


then 


(i)  Gi  has  a  relative  degree  A:,  where 
k^n  xiq  —  riy 

k  =  min  (n,  q)  otherwise; 


(ii)  there  exists  a  strictly  proper  IMC  filter 


i  is  an  integer 


(where  Fi  has  a  relative  degree  such  that 
Qo  =  [Go]m'H  is  proper)  such  that  Q°  = 
[Gi]m’F?  has  at  least  the  relative  degree 
of  Ql; 

(iii)  if  Gi  has  no  zeros  along  the  imaginary  axis 
then  QVQi  is  bounded  along  the  imaginary 
axis;  in  particular,  there  exists  a  finite  8  such 
that 


=  8. 


Proof,  (i)  There  are  three  cases  to  be  con¬ 
sidered,  namely  q-n,q<n  and  q>n. 

For  the  case  where  q-n,  consider 


lim  ^‘^Gi  =  lim  ^"Go  +  lim 


\+qiR{>' 


Since 


lim5''Go  =  Ci,  C]?^0, 

S—*'X 


and 


we  have 


^2^0, 


lim  ^'^Gi  =0  if  C2  =  -Ci, 


or 


lim^'^Gt^O  \ic2^^C\. 


Hence  the  relative  degree  oi  \s  k>q  -  n. 
Next  consider 


Lemma  1.  Suppose  that  Gq  has  a  relative  degree 
of  n>  I,  Q{)  =  [Gojm^H  is  proper,  and  Ri  has  a 
relative  degree  of  q^l.  If  G,  is  updated 
according  to 


lim  s'^Gi  =  lim  s'^Gq  +  lim - . 

for  q  <n.  Clearly, 

lim  ^'^Gi  =  lim  0, 

5— »X 


G|  -  G„  + 
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since  is  proper  and  Go  has  a  relative  degree 
larger  than  q.  Therefore  the  relative  degree  of 
Gj  is  k  =  q  <n. 

Similar  considerations  applying  to 


lim  5'*Gi  =  lim  ^^Go  +  lim 


1  + 


for  q>n  will  lead  to  the  conclusion  that  the 
relative  degree  of  Gj  is  k  =  n<q. 

(ii)  It  follows  immediately  from  the  above 
that  2°  =  [G,]“*F?  has  at  least  the  relative 
degree  of  Qi  =  [Gojm'H  if 


Obviously,  since  k^  \  and  the  relative  degree  of 
F{)  is  at  least  n,  F?  is  strictly  proper. 

(iii)  Now  it  is  easy  to  conclude  that  for  the 
above  choice  of  F?,  Q?/ Qi  is  proper.  Therefore 
if  Gi  has  no  zeros  along  the  imaginary  axis  and 
Go  has  no  poles  along  the  imaginary  axis  then 
QVQo  is  bounded  along  the  imaginary  axis  and 
there  exists  a  finite  6  such  that 


sup 

60]  to  2 


g?ac^) 

G/Gco) 


=  5. 


□ 


Remarks, 

•  It  will  be  clear  from  Theorem  3  that  it  is 
undesirable  for  8  to  become  excessively  large. 

•  Since  all  poles  of  Go  that  are  also  poles  of  G 
are  always  retained  by  a  well-identified  Gi,  it 
is  clear  that  poles  of  Go  that  are  also  poles  of 
G,  even  if  they  are  near  the  imaginary  axis, 
will  not  cause  8  to  assume  an  excessively  large 
value. 

•  Zeros  of  Gi  near  the  imaginary  axis  for 
(s)^^o}^o)2  may  not  appear  as  zeros  of  Gq. 
However,  these  zeros  would  be  zeros  of  the 
plant  G  if  Gj  is  a  well-identified  model  of  G 
for  cjj  <  oj  <  W2.  This  would  happen  only  if  we 
increased  the  closed-loop  bandwidth  to  the 
frequency  range  where  the  plant  has  (stable 
or  unstable)  zeros  near  the  imaginary  axis,  and 
the  controller  has  excessively  large  gain. 
Therefore  we  can  prevent  8  from  being 
excessively  large  by  observing  well-known 
design  guidelines. 

•  If  Go  has  poles  near  to  the  imaginary  axis  for 

<  a>2  that  are  not  poles  of  the  plant  G 
then  a  well-identified  model  Gi  for  G  either 
will  have  no  poles  at  these  locations  or  will 
have  approximate  pole  zero  cancellations  at 
these  locations.  In  these  situations  8  may 
become  excessively  large.  It  is  therefore 
important  to  verify  that  an  identified  model 


(such  as  Go)  has  no  unnecessary  poles  near  the 
imaginary  axis. 


Lemma  2.  If  Gq  and  Gi  are  strictly  proper  stable 
models  of  the  plant  G,  and  =  Gogo  is  the 
closed-loop  transfer  function,  where  Q{y  is 
designed  by  the  IMC  method,  then  there  exists  a 
finite  7)  such  that 


sup 

6U I  :£  6l>  £  6i»2 


Gi(ja>)  GqG^) 
GoG^) 


=  '»?• 


Proof,  Clearly  the  transfer  function  G^  -  Gq  is 
stable.  Also,  from  the  facts  that  F{=  Gogo  and 
that  the  Qi  designed  by  the  IMC  method  is 
proper  and  stable,  it  is  easy  to  conclude  that 


U  =  (G,~-Go)Qi 


is  proper  and  stable. 


□ 


Remarks, 

•  It  will  be  clear  from  Theorem  3  that  it  is 
undesirable  for  77  to  become  excessively  large. 

•  7]  may  become  excessively  large  if  Gi  has  poles 
near  the  imaginary  axis  for  coi<co^cu2  that 
are  not  poles  of  Gq,  and  if  is  very  near  to 
the  critical  frequencies  of  these  poles. 
However,  this  is  impossible  because  if  Gi  were 
a  well-identified  model  of  the  plant  G  then  G 
would  have  poles  near  to  ±jA{  that  are  not 
poles  of  Go.  Under  these  conditions,  the  actual 
closed-loop  system  would  be  unstable  or 
almost  unstable.  Furthermore,  A^  cannot  be 
close  to  the  zeros  of  Go  near  the  imaginary 
axis  for  o>i  <  w  <  (02,  because  this  will  result  in 
a  controller  with  an  excessively  large  gain  in 
that  frequency  range.  Hence,  by  ensuring  that 
the  actual  closed-loop  system  is  far  from 
instability  (recall  the  guidelines  given  at  the 
end  of  Section  6.3)  and  by  observing 
well-known  controller  design  guidelines,  we 
automatically  prevent  17  from  taking  exces¬ 
sively  large  values. 


Theorem  3.  Let  Gq  be  a  stable  strictly  proper 
model  of  the  plant  G.  Suppose  that  G  is 
stabilized  by  the  controller  designed  accord¬ 
ing  to  the  IMC  method  described  in  Section  2.1, 
and  hence  has  the  description 


where 


G  -  Go  + 


Ri 


Qo  ~  [Gojm'^O- 


(20) 

(21) 


with  [Go]m  the  minimum-phase  factor  of  Go.  Let 
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1.  When  can  one  redesign  the  controller  and 
expand  the  closed-loop  bandwidth,  without 
re-identifying? 

2.  When  should  one  re-identify? 

3.  What  does  one  want  to  identify  in  the 
re-identification  procedure? 

4.  What  can  one  identify  in  the  re-identification 
procedure? 

In  order  to  check  if  an  identified  model  is 
actually  good  for  our  purpose,  we  have 
presented  two  methods  for  validating  an 
identified  model  experimentally  before  it  is 
employed  in  controller  redesign. 

The  main  conclusion  of  this  paper  is  that, 
given  a  strictly  proper  stable  model  of  a  strictly 
proper  stable  plant,  it  is  possible  to  improve  the 
robust  performance  of  a  closed-loop  system 
through  the  windsurfer  approach  if 

(i)  the  deterioration  in  performance  robustness 
caused  by  increasing  the  closed-loop  band¬ 
width  is  mainly  contributed  by  the  phase- 
insensitive  factor; 

(ii)  the  deterioration  in  performance  robustness 
caused  by  increasing  the  closed-loop  band¬ 
width  resulted  in  a  sufficiently  high  signal- 
to-noise  ratio  associated  with  the  closed- 
loop  output  error; 

(iii)  the  designed  closed-loop  bandwidth  has  not 
approached  the  minimum  critical  frequency 
corresponding  to  the  unstable  zeros  of  the 
plant  or  the  existing  model. 
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A  Family  of  Norms  for  System  Identification  Problems 
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Abstract— In  this  correspondence,  we  introduce  a  family  of  norms 
that  may  prove  useful  in  system  identification  problems.  The  important 
property  of  the  new  norm  is  that  for  a  given  sequence,  its  value  in  the  limit 
will  converge  to  the  supremum  over  all  frequencies  of  the  spectrum  of  the 
sequence.  Using  this  property,  a  procedure  is  outlined  to  approximately 
minimize  the  weighted  Coo  norm  of  the  frequency  response  estimation 
error.  In  addition,  a  test  for  checking  the  Whiteness  of  a  given  sequence 
IS  proposed. 


I.  Introduction 

The  parametric  approach  to  system  identification  is  based  on  select¬ 
ing  an  appropriate  model  structure  and  a  search  for  the  parameters 
of  the  model  that  best  describes  the  data.  Usually,  the  best  model 
within  the  model  set  is  characterized  as  the  one  that  minimizes  a 
selected  norm  of  the  prediction  errors.  By  far  the  most  popular  norm 
is  the  sum  of  the  square  of  the  prediction  errors — the  quadratic  norm. 
In  this  correspondence,  we  introduce  a  new  family  of  norms  that 
seem  to  be  useful  in  system  identification  problems.  The  new  norms 
have  interesting  interpretation  in  the  frequency  domain  and  include 
the  usual  quadratic  norm  as  special  case.  The  important  property  of 
the  new  norm  is  that  in  the  limit,  its  minimization  is  equivalent  to 
minimizing  the  supremum  over  all  frequencies  of  the  spectrum  of  the 
prediction  error  or  equivalently,  minimizing  its  C:^  norm. 
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II.  Definitions  and  Preliminaries 

Lei  us  assume  wc  are  given  a  scalar  bounded  sequence  {ej,i  = 
1,  -  •  • ,  /V}  which  in  our  application  represents  the  prediction  errors 
computed  from  the  observed  data  and  a  guessed  model  parameter 
vector  6.  Based  on  this  sequence,  form  the  (TV-f  Af  —  1)  x  Af  matrix 


Emm 


•/N 


Cl 

0 

0 

C2 

Cl 

0 

CM 

CM-l 

ei 

CM 

CM-l 

'  *  •  eyv-Af+i 

0 

CM 

•  •  •  eAr-M+2 

0 

0 

CM 

(1) 


is  a  first  attempt  to  whitening  the  prediction  error  in  addition  to 
minimizing  its  variance. 

Note  that  the  whiteness  of  the  prediction  error  is  an  important 
factor  in  the  validating  a  computed  model  [I],  this  desirable  property 
of  the  prediction  error  is  not  reflected,  however,  in  any  form  in  the 
usual  quadratic  norm.  But  Vm  is  not  only  a  function  of  the  variance 
of  the  prediction  error  but  is  also  a  function  of  the  values  of  the 
autocorrelation  of  the  prediction  error  for  time  shifts  up  to  Af  —  1, 
and  by  increasing  M  more  and  more  of  the  temporal  behavior  of  this 
autocorrelation  affects  Vm- 

III.  Frequency  Domain  Properties 

Now  we  discuss  the  frequency  domain  interpretation  of  the  new 
norm.  First  assume  the  limit  of  ai  defined  in  (2)  as  N  goes  to  infinity 
exists 


ltn^~.»oo  tii  ““  n.|. 


(7) 


where  we  assume  1  <  Af  <  iV.  Note  that  Emm  is  constant  along 
the  diagonals,  and  for  Af  =  1,  Emi  is  a  column  vector  with  eify/N 
as  its  elements.  To  simplify  the  notation,  we  denote  this  vector 
by  Em-  Moreover,  the  matrix  Emm  is  completely  specified  when 
Em{=  Emi)  and  the  value  of  M  are  given. 

It  is  simple  to  see  that  the  matrix  EJjm^nm  is  symmetric,  at 
least  positive  semidefinite,  and  Toeplitz.  The  elements  of  this  matrix 
are  estimates  of  the  autocorrelation  function  of  the  sequence  a.  More 
explicitly,  define  the  sequence  a,  (i  =  0,  •  •  • ,  Af — 1)  in  terms  of  e,  as 

1 

~  iv 

j=i 

Then  a  simple  computation  shows 


ao 

ai 

flAf-l 

Emm  Emm  = 

ai 

ao 

aM^2 

(3) 

aM--i 

CiM-2  *  •  * 

Oo  , 

Using  these  definitions,  we  define  the  new  norm  as  the  maximum 
eigenvalue  of  Emm  Emm 

Vm{En)  =  KEnmEsm)  =  ^(Enm)  (4) 

where  A(F)  denotes  the  maximum  eigenvalues  of  F,  and  a(F) 
denotes  the  maximum  singular  value  of  F.  For  simplicity,  wc  usually 
delete  the  argument  of  Vm  and  assume  it  is  understood  to  be  a 
function  of  Em  which  is  itself  formed  from  the  prediction  errors  a. 
Note  that  Vm  defined  in  (4)  is  not  mathematically  a  norm  on  7^^; 
however.  \/Vm(Em)  is  a  valid  norm  for  Em*  and  only  to  simplify 
the  presentation  we  refer  to  Vm  as  a  norm. 

Also,  for  Af  =  1,  Vm  is  identified  with  the  usual  quadratic  norm. 
From  another  point  of  view.  Vi  only  includes  an  estimate  of  the 
autocorrelation  function  of  the  prediction  error  for  zero  shift,  ao- 
Moreover,  Vm  is  nicely  bounded  by  Vi  as  follows: 


If,  in  addition,  a,  is  in  /i,  then  the  spectrum  of  the  prediction  error  is 

oo 

5„(u>)  =  Y,  (8) 

!;=<->  OO 


where  we  set  a_i;  =  ajt  because  we  are  dealing  with  a  real  sequence. 
It  is  shown  in  [2]  that  the  following  are  true: 

ao  =  Ly‘'s„(u«)<ia»  (9) 

MCm)  <  sup|<^|<,5„(a>)  =  lim  A(Cm)  (10) 
A(Cm)  >  inf|„i<,  S„(w)  =  lim  X{Cm)  (11) 

At  — ^oo 


where  A(^)  A(F)  denote,  respectively,  the  smallest  and  the 
largest  eigenvalue  of  F,  and  the  Toeplitz  matrix  Cm  is  defined  as 
follows: 


Oo 

ao 

aAZ-i 

Cm  — 

ai 

ao 

a  A/ -2 

aM~2  •  ’  * 

ao 

Theorem  1:  The  following  limits  hold: 

lim  Em  Em  =  ^  [  S€«(tj)du; 

N— *oo  27r 

Jiit^  =  inf|„|<,  Se<(u)) 


(12) 


(13) 

(14) 

(15) 


where  we  assume  that  N  goes  to  infinity  faster  than  M. 

Proof:  Relation  (13)  follows  from  the  definition  of  ao  and  (9). 
Moreover,  by  definition  of  Cm*  we  have 

lim  eJjmEmm  —  Cm-  (lb) 

M  —*00 


IlfiNlli  =  V,{Es)  <  Vm{Es)  <  MVi(En)  =  M\\En\\1.  (5) 

To  illustrate  some  of  the  properties  of  Vm  for  Af  >  1.  assume 
Af  =  2.  The  maximum  eigenvalue  of  Em^Em2  is  simple  to  compute 
and  is  given  by 

V2  —  ao  +  1^1 1- 

In  this  case,  not  only  the  sum  of  square  of  prediction  errors  is  included 
in  the  performance  measure,  but  this  norm  also  includes  an  estimate 
of  the  autocorrelation  function  of  the  prediction  error  at  the  first  time 
shift.  Therefore,  minimizing  V2  will  force  |ai|  to  small  values.  This 


Substituting  this  in  (10)  and  (11)  and  noting  that  the  eigenvalues  of 
a  matrix  are  continuous  functions  of  the  elements  of  the  matrix,  the 
other  results  follow  immediately.  □ 

In  identification  problems,  we  estimate  the  model  parameters  6  by 
minimizing  Vm{Em{^))-  The  notation  Em{^)  emphasizes  the  fact 
that  the  prediction  error  is  a  function  of  0  and  the  minimization  is 
carried  over  elements  of  9.  Relation  (14)  is  very  illuminating  in  this 
respect  and  shows  that  by  minimizing  Vm  as  A/  approaches  infinity, 
the  supremum  over  all  frequencies  of  the  spectrum  of  prediction  error 
is  minimized.  Because  of  this  property,  wc  refer  to  the  identification 
problem  using  the  new  norm  (with  large  values  of  M)  as  the  Coo 
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idem ificat ion  problem.  In  contrast,  by  inininiizing  the  usual  quadratic 
norm,  the  integral  of  the  spectrum  of  prediction  error  over  all 
frequencies  is  minimized  [I],  and  this  can  be  referred  to  as  £2 
identification  problem  [sec  (13)], 

Now  we  explore  the  usefulness  of  the  new  norm  in  identification 
problems,  and  relate  the  Coo  norm  of  the  spectrum  of  the  prediction 
error  to  £00  norm  of  the  transfer  function  estimation  error.  Following 
the  procedure  used  in  [1],  let  us  assume  the  true  system  output  is 
generated  by 

yt  -  Go((j)u,  +  v,  (17) 

where  the  additive  noise  u/  has  the  spectrum 

5t,if(cu)  =  Aoj/foCc"^  )j^  (18) 

with  Ho(oo)  =  1,  Also  assume  the  suggested  model  for  the  system 
has  the  form 


yt  =  G(g,  e)ui  +  H(q,  e)e,  (19) 


where  S  is  the  vector  of  unknown  parameters.  Provided  u  and  v  are 
independent,  it  is  simple  to  show  that  the  spectrum  of  the  prediction 
error  in  this  case  is  given  by  [1] 


S  (u)  e)=z  ^)p5uu(c^)  +  5vu(u;) 


(20) 


where  G  =  G  —  Go  is  the  error  in  estimating  the  transfer  function. 

Unfortunately,  the  term  Svv/\H\^  in  (20),  which  depends  on  the 
parameter  6,  prevents  us  from  directly  resulting  the  minimization  of 
to  the  minimization  of  |(5|.  Using  a  procedure  similar  to  the  one 
outlined  in  [1],  we  can  first  use  a  high-order  ARX  model 


MQ)yt  =  B{q)ut  -h  et  (21) 

to  approximate  by  and  filter  both  ut  and  yt  by 

A{g).  Let  us  denote  the  filtered  input  and  output  by  u{  and  y/, 
respectively.  Next  use  the  following  output  error  model  to  estimate 
the  model  parameters  9 

y{  =  G{q,  e)u{  +  e,.  (22) 

Now  using  (20)  we  have 
S„(u^.e)  =  IGCe-’*',  e)|*|>l(e’‘")|*S„4a;) 

+l/l(e-'“)l*S....(a;).  (23) 

If  ll\A\  is  a  good  approximation  to  |ffo|,  then  the  last  term  in  (23) 
is  a  constant  equal  to  Ao,  and  we  can  write 

S„{u;,  0)  -  Ao-  (24) 

Using  (24),  it  is  clear  that  minimizing  the  supremum  of  5,,  in  this 
case  will  directly  lead  to  the  minimization  of  the  weighted  Coo  norm 
of  G.  Note  that,  as  expected,  the  weighting  \  AfSuu  («  Suu/\Ho\^) 
puts  more  emphasis  on  the  frequency  ranges  where  the  signal-to- 
noise  spectral  ratio  is  large.  Also,  by  repeating  the  experiment  with  a 
different  input  (changing  S„u)  ,  we  have  the  flexibility  of  changing 
this  weighting  factor. 

Note  that  if  the  model  used  for  G  in  (22)  is  finite-impulse-response 
(FIR),  then  the  new  norm  is  a  convex  function  of  the  parameters; 
otherwise,  the  problem  in  general  is  nonconvex  and  may  be  difficult 
to  solve. 

Moreover,  after  minimizing  Vm  (for  sufficiently  large  value  of 
M),  we  can  compute  a  good  estimate  for  the  supremum  over  all 
frequencies  on  the  left-hand  side  of  (24).  Since  the  supremum  over 
all  frequencies  of  the  first  term  on  the  right-hand  side  of  (24)  can 
be  approximated  by  the  minimum  value  of  Vm,  and  the  value  of  Ao 
(variance  of  the  noise)  can  be  approximated  when  we  are  computing 
the  ARX  structure  in  (21),  this  gives  an  estimate  for  the  weighted 
Coo  norm  of  modeling  error. 


TABLE  I 

Sami^lh  Mi^n,  Standard  Deviation,  and  95%  Confidence 
Tiirlsmold  oi-  C  for  Several  Different  Values  of  M  and  N 


M 

10 

20 

50 

1.59 

3.87 

16.77 

A'=I(K) 

0.76 

1.92 

12.42 

3.08 

7.50 

36.90 

0.94 

2.05 

6.88 

II 

0.36 

0.80 

3.46 

1.60 

3.60 

12.30 

0.52 

0.99 

2.64 

iV  =  000 

0.17 

0.31 

0.84 

0.84 

1.53 

4.26 

0.34 

0.63 

1.48 

N  =  1000 

0.11 

0.18 

0.38 

0.55 

0.95 

2.15 

IV.  Whiteness  Test 

Now  we  digress  a  little  and  show  how  the  relations  given  in  (14) 
and  (15)  can  be  used  to  derive  an  interesting  test  for  checking  the 
whiteness  of  a  sequence.  Note  that  a  white  sequence,  by  definition, 
has  a  flat  spectrum  (impulsive  autocorrelation  function).  Therefore,  a 
sequence  is  white  if  and  only  if  the  ratio  of  the  supremum  to  infimum 
of  the  spectrum  of  the  sequence  is  unity.  Using  this  observation,  the 
condition  number  of  E/vm  as  N  and  M  approach  infinity  (with  N 
faster  than  M)  should  converge  to  unity  for  a  white  sequence. 

With  this  introduction,  we  propose  the  following  condition  number 
test  for  checking  the  whiteness  of  a  given  sequence.  Choose  an 
appropriate  value  of  M  and  compute  the  ratio 


where  a  and  a  are  the  smallest  and  largest  singular  values  of  the 
Et\;M  matrix.  If  C  is  less  than  a  specified  threshold,  then  declare  the 
sequence  as  white;  otherwise,  declare  it  as  nonwhite.  Note  that  C  is  the 
condition  number  of  Enm  squared  minus  one,  and  hence  is  always 
positive;  and,  for  a  white  or  slightly  correlated  sequence,  it  should  be 
close  to  zero,  and  otherwise  should  be  considerably  larger  than  zero. 

To  use  this  test,  we  have  to  compute  the  values  with  which  the 
ratio  C  should  be  compared  for  any  given  values  of  N  and  M.  Note 
that  assuming  we  are  dealing  with  a  white  sequence,  it  is  almost 
impossible  to  analytically  compute  the  probability  density  function 
of  C,  and  from  that  the  necessary  95%  confidence  threshold.  By  95% 
confidence  threshold,  we  mean  the  value  of  a  for  which  the  following 
relation  holds: 


Prob«  <  q)  =  0.95  (26) 

assuming  the  sequence  under  consideration  is  actually  white. 

We  used  a  series  of  simulations  to  compute  these  confidence 
thresholds  and  the  first  and  second  moment  properties  of  assuming 
the  sequence  under  consideration  is  actually  white.  For  this  purpose, 
for  any  given  values  of  N  and  A/,  we  generated  1000  different 
random  white  sequences  (of  length  iV),  and  in  each  case  computed 
the  ratio  C  given  in  (25).  The  computed  results  for  different  values 
of  M  and  N  are  given  in  Table  I.  The  three  numbers  in  each  cell 
of  this  table  from  top  to  bottom  are,  respectively,  the  mean,  standard 
deviation,  and  95%  confidence  threshold  of  C- 

Now  to  recapitulate — we  use  Table  I  for  checking  the  whiteness  of 
a  sequence  as  follows.  Choose  the  value  of  N  closest  to  the  length  of 
the  sequence.  Tlien  choose  an  appropriate  value  of  M,  Usually,  M 
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TABLE  II 

Nummoer  of  Times  the  Sequences  Did  Not  Pass  the 
Whiteness  Test  AS  THE  Pole  Location  r  is  Varied 


r 

0 

0.05 

0.1 

0.15 

0.2 

Condition  No. 

51 

115 

415 

707 

940 

Chi-Squarc 

54 

83 

267 

577 

884 

should  be  chosen  between  A71O  to  A750.  Next  choose  a  segment 
of  the  sequence  with  length  N  and  form  the  matrix  Enm.  and  from 
that  compute  the  C  given  in  (25).  Next,  refer  to  Table  I  and  check 
whether  the  computed  value  of  <  is  less  than  the  required  threshold. 
If  this  is  the  case,  the  sequence  is  most  probably  white.  Otherwise, 
it  is  most  probably  colored. 

We  compared  the  performance  of  the  proposed  condition  number 
whiteness  test  to  the  chi-square  test  [IJ.  [3).  We  remind  the  reader 
that  in  the  chi-square  test,  the  following  sum  is  computed 


V  = 


(27) 


where  the  ai  (t  =  0,  •  •  • ,  M  “  1)  are  defined  in  (2),  and  the 
result  is  compared  to  the  95%  confidence  threshold  of  the  chi-square 
distribution.  The  reason  for  choosing  M  —  1  as  the  upper  limit  of  the 
surrunation  in  (27)  is  because  in  the  condition  number  test  only  the 
autocorrelation  terms  up  to  M  —  1  are  present. 

For  this  comparison,  we  generated  (colored)  sequences  by  passing 
white  sequences  (which  were  uniformly  distributed  at  each  instant) 
through  the  first-order  filter 

H(z)  =  — .  (28) 

z  —  r 


For  each  pole  location  (value  of  r),  we  filtered  1000  different  white 
sequences  (each  being  500  points  in  length)  by  if  (2).  and  after 
subtracting  the  sample  mean  of  the  sequence  computed  the  ratio  C 
and  r/.  in  each  case  assuming  M  =  10.  The  number  of  times  the 
computed  C  was  larger  than  0.84  (the  95%  confidence  threshold  of 
condition  number  whiteness  test  for  N  =  500  and  M  =  10)  is  given 
in  Table  11.  The  number  of  times  the  computed  rj  was  laiger  than  16.9 
(which  is  the  95%  confidence  threshold  of  the  chi-square  distribution) 
are  also  shown  in  Table  IL  As  can  be  seen,  the  condition  number  test 
repeatedly  performed  better  in  this  particular  problem  [it  has  a  sharper 
operating  characteristic  (OC)  curve  (cf.  [4]).  We  emphasize  that  the 
condition  number  whiteness  test  is  just  another  lest  that  may  prove 
useful  in  a  particular  application. 

Moreover,  note  that  for  r  =  0,  the  output  of  the  first-order  filter 
will  be  a  white  sequence,  and  we  expect  that  about  5%  of  the  1000 
runs  (50  runs)  will  fail  the  whiteness  test,  which  is  very  close  to  the 
values  that  are  actually  obtained  in  these  simulations  for  both  tests. 

Finally,  we  indicate  that  there  are  several  other  whiteness  tests 
that  can  be  devised  using  the  singular  values  of  the  Ej^jm  matrix. 
Particularly  interesting  tests  are  based  on  comparing  the  following 
ratios  with  appropriate  thresholds 

0  =  (29) 

C2  =  (30) 


with 


a 


2  _ 

av 


M 

t=i 


(31) 


where  <Ti  arc  the  singular  values  of  Eat  a/.  Using  the  dcfinilion  of  no 
and  the  fact  that  the  trace  of  the  matrix  in  (3)  is  simply 


Mao,  it  is  clear  that  we  have  no  =  Moreover,  it  is  clear  from 
(31)  that  the  following  inequalities  hold 

2:^  <  =  "0  <  (32) 

Note  that  this  relation  holds  for  any  finite  value  of  M  (with  M  <  N), 
and  is  the  analog  of  the  following  series  of  inequalities 

inf|^|<^  St.e(t^)  ^  2^  j  S,-f  (u>)  (iw  <  SUP|^j^^  Sre-(txj).  (33) 

For  a  white  sequence,  the  left-hand  and  right-hand  side  of  the 
inequalities  in  (32)  should  converge  to  the  same  value,  and  the  tests 
given  in  (29)  and  (30)  are  based  on  this  observation.  Moreover,  note 
that  because  of  (32).  the  ratios  <i  and  C-i  arc  always  positive,  and 
for  a  white  sequence  both  converge  to  zero  as  M  and  N  approach 
infinity  with  N  faster  than  M. 

V.  Convergence  and  Convexity 

The  norm  introduced  in  (4)  has  some  interesting  properties  that 
we  shall  discuss  next.  Let  us  fix  M,  and  assume  we  are  given  a 
model  structure  and  identify  the  parameter  vector  $  of  this  model 
by  minimizing  Vp{E^'(S))  where  P  is  a  positive  integer  less  than 
M  (F  <  M).  Let  us  assume  this  optimization  problem  has  a  unique 
global  minimum  that  we  will  denote  by  6^,  and  denote  the  prediction 
error  sequence  resulting  from  this  choice  of  the  parameter  vector  by 
=  EN(e’’).  Similarly,  define  9'^  and  =  Ew(e'^). 

Theorem  2:  The  following  series  of  inequalities  hold: 

Vp(E^)  <  Vp{E}f)  <  Vm{E^)  <  Vm(E^).  (34) 

Proof:  Beginning  from  the  left-hand  side,  the  first  inequality 
follows  from  the  fact  tat  the  elements  of  are  generated  from  the 
model  parameters  that  minimize  Vp.  The  second  inequality  follows 
from  the  interlacing  property  of  the  eigenvalues  of  a  symmetric  matrix 
[5].  Note  that  is  the  first  P  x  P  principle  minor  of 

where  E^p  and  E'^m  are  defined  in  terms  of  E}f 
using  (1).  The  third  inequality  follows  from  the  fact  that  Fjv  is 
formed  from  model  parameters  that  minimize  V'vf.  □ 

The  relation  given  in  (34)  is  especially  useful  if  we  set  P  =  1.  Then 
Vi(Ejv)  is  the  minimum  value  of  the  usual  least  squares  performance 
measure.  Also,  in  this  case,  we  can  add  another  important  inequality 
to  the  set  given  in  (34). 

Corollary  J:  The  following  series  of  inequalities  hold: 

V,(EA,)  <  Vi(Em  <  V^iE^f)  <  V.M{Eh) 

<  Vi(El,)  +  (M  -  1)  max (|nl|.  •  ■  • , |a"M-,|)  (35) 

where  o*  are  computed  from  the  elements  of  Ely  using  the  relation 
given  in  (2). 

Proof:  The  first  three  inequalities  follow  by  setting  P  =  1  in 
(34).  Moreover,  because  Q  =  (Pa'a/)^Pv.\/  is  Toeplitz  with  al  on 
its  main  diagonal,  each  eigenvalue  of  Q  denoted  by  A  satisfies  the 
following  inequality: 

lA-ail  <  (M-  l)max(|n;|,---.l(,;^,_,|).  (36) 

This  follows  from  Gershgorin’s  circle  theorem  [5],  and  hence  the 
last  inequality  in  (35)  holds.  Note  that  the  aj  in  (35)  are  estimates  of 
the  autocorrelation  functions  of  the  prediction  error  computed  from 
parameters  that  are  obtained  by  minimizing  the  quadratic  norm.  □ 
Now  let  us  assume  that  for  a  particular  problem  Vi(Ersf(0))  and 
Fay{P/v(^)),  both  have  unique  global  minimum  that  are  denoted 
by  0^  and  0^,  respectively.  Moreover,  let  us  assume  that  in  this 
problem,  the  last  term  in  (35)  goes  to  zero  as  the  number  of  data 
points  increases.  In  other  words,  assume  for  a  fixed  A/,  we  have 

lim  max  I)  =  0. 

/v  — -oo 


(37) 
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Then,  using  (35),  it  is  clear  iliai 

lun  Vi(Ea;(0'))=  Vi(Er^(G^)).  (38) 

N— oo  A* — oo 


Now  using  the  assumption  on  the  uniqueness  of  the  global  minimum 
of  V'l,  it  is  clear  that  in  the  limit  and  6^^  will  be  identical. 
Stated  more  loosely,  if  the  prediction  error  for  the  quadratic  norm 
minimization  is  white,  then  the  parameters  obtained  by  minimizing 
the  new  norm  will  be  identical  to  those  obtained  by  minimizing  the 
usual  quadratic  norm. 

To  guarantee  that  each  V'j  and  Vm  have  global  minima  only,  let  us 
choose  an  ARX  model  for  the  structure  of  the  system.  In  this  case, 
it  is  well  known  that  the  scaled  prediction  error  can  be  written  as 

Er,  =  -^(Y-^e)  (39) 

where  is  the  matrix  of  regression  vectors  and  Y  is  the  vector  of 
output  values  [1].  In  this  case,  Vi{En(0))  in  a  convex  function  of 
0,  and  the  minimization  problem  has  only  global  minima.  Moreover, 
if  ^  is  full  column  rank,  then  the  minimum  is  unique. 

Next  we  show  that /or  an  ARX  model  structure,  Vm  is  also  a  convex 
function  of  the  parameters.  To  see  this,  note  that  the  matrix  E/vay 
can  be  written  as 


A/  _  M 

enm  =  yyiE^-wj  =  -=yTi(Y  -  ^e)wT 

i=i 


(40) 


where  Wi  £  is  the  standard  basis  column  vector  with  I  in  its 
rth  entry  and  all  other  elements  zero.  Also,  the  {A^  4*  M  —  1)  x  A'’ 
matrix  r,  is  defined  as 


r.  = 


In  X  N 
P(AY-i)x  A 


(41) 


Moreover,  denote  the  fth  column  of  ^  by  <i>i  and  the  tth  element  of 
6  G  by  6i.  Then  (40)  can  be  rewritten  as 


Enm  =  Co  + 

i=i 


(42) 


Note  that  C>,  j  =  0,  •  •  • ,  L,  have  the  same  special  structure  as  Eis;m^ 
namely,  being  constant  along  the  diagonals.  Now,  using  (42),  it  is 
clear  that  Ej^m  is  affine  in  0^  and  a{EsM)  is  a  convex  function 
of  6.  Therefore,  Vm,  being  the  square  of  ^(Enay ),  is  also  a  convex 
function  of  0,  and  the  minimization  problem  has  only  global  minima 
in  this  case. 

Using  these  facts,  if  we  use  an  ARX  model  structure,  and  if  it  hap¬ 
pens  that  the  resulting  prediction  errors  are  white  (and  consequently 
the  relation  (37)  holds),  then  we  are  guaranteed  that  the  parameter 
estimate  using  the  new  norm  will  be  same  as  the  parameters  using 
the  quadratic  norm  for  N  oo.  This  is  promising  because,  for 
the  sum  square  norm  and  ARX  structure,  there  are  many  established 
properties  [1]  that  readily  extend  to  the  new  norm. 

If  the  prediction  error  sequence  is  not  white,  however,  which  will 
be  the  case  if  the  “true**  model  does  not  have  an  ARX  structure, 
then  the  estimate  given  by  minimizing  Vm  will  usually  be  different 
from  those  obtained  from  the  quadratic  norm  minimization.  Note  that 
the  new  norm  forces  the  autocorrelation  of  the  prediction  error  for 
nonzero  shifts  to  small  values  (whitens  the  prediction  error),  and  this 
property  may  result  in  a  better  estimate  of  the  model  parameters 


(compared  to  quadratic  norm  for  a  given  model  order)  when  the  true 
model  is  not  actually  inside  the  model  set. 

As  we  have  shown  previously,  for  an  ARX  model  structure,  the 
matrix  EfsjM  is  affine  in  the  parameters,  and  we  are  interested  in 
minimizing  the  maximum  singular  value  of  E^^M^  This  problem 
is  already  discussed  in  the  literature  (6|.  and  a  recent  algorithm  is 
proposed  in  [71.  By  exploiting  the  special  structure  of  the  matrices 
C,  defined  in  (42),  however,  it  may  be  possible  to  increase  the 
efficiency  of  the  algorithm  in  [7],  Also,  in  our  application,  the  size 
of  the  matrices  involved  is  quite  large,  and  special  attention  should 
be  paid  to  the  memory  management  and  algorithmic  implementation: 
otherwise,  huge  amounts  of  memory  will  be  required  to  perform  the 
optimization  even  for  modest  values  of  M  and  N. 


VI.  Conclusion 

Although  we  have  presented  some  preliminary  results  on  the 
properties  of  the  Coo  identification  problem  in  this  correspondence, 
much  further  work  is  required  to  explore  the  properties  of  the  new 
norm  in  details.  To  perform  this  task,  an  efficient  implementation 
of  the  required  minimization  algorithm  is  required  so  realistic  high- 
order  models  can  be  estimated  and  their  properties  can  be  compared 
to  those  of  the  least  square  minimization.  As  we  previously  noted, 
the  convexity  of  the  new  norm  when  an  ARX  model  is  used 
is  an  important  property,  and  hence  many  techniques  of  convex 
optimization  can  be  used  for  the  solution  of  this  problem. 
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A.l>stract  Two  least-squares  based  methods  are  presented  for 
obtaining  ARX  model  sets.  The  first  is  obtained  using  proper¬ 
ties  of  high-order  ARX  models  and  the  second  uses  a  stochastic 
embedding  scheme  on  the  residuals  from  an  ARX  model  of  any 
order.  Esther  of  the  ARX  model  sets  is  useful  for  robust  control 
of  systems  with  uncertain  parameters.  Using  the  high  order  ARX 
model  approach.,  the  parameter  uncertainty  lies  in  a  confidence  el¬ 
lipsoid.  Using  the  stochastic  embedding  approach,  the  parameter 
uncertainty  is  ia  confidence  box.  For  scalar  plants,  both  cases  can 
be  handled  using  convex  programming  to  obtain  the  exact  stabil¬ 
ity  robustness  margin  for  a  particular  controller.  However,  because 
the  uncertainty  description  is  probabilistic,  the  robustness  property 
has  to  be  associated  with  a  confidence  level,  i.e.,  a  probability  of 
stability. 


1  Introduction 

Suppose  it  is  desired  to  control  the  sin^e-input-single-output  sta¬ 
ble  discrete-time  system, 

y  =  Gu  +  i/c,  c  €  (l) 

where: 

•  G  and  H  are  known  only  to  be  causal  linear-time-invariant 
(LTI)  stable  systems  with  imknown  transfer  functions  G{z) 
and  H{z)f  respectively. 

•  The  sequences  y  and  u  are,  respectively,  the  sensed  output, 
and  the  applied  control  input.  All  that  is  available  is  the 
finite  data  record 

{vt.ttt  1 1  =  (2) 

where  yt  and  ut  are  the  values  of  the  sequences  y  and  u,  re¬ 
spectively,  at  the  sample  time  t. 

•  The  sequence  e  is  unpredictable,  but  is  known  to  be  a  member 
of  the  set  E(A)  where  A  is  unknown:  likely  candidlates  for 
E(A)  are  PO^^(A),  the  set  of  sequences  with  power  bound 
A,  or  IJD(A),  ii.d.  zero-mean  sequences  with  variance  A. 
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Computer  Sciences,  Contract  No.  F49620-90-C-0064,  NSF 
U.S./ Australia  Cooperative  Research  Program,  Travel  Grant  INT- 
9014152,  and  the  Australian  National  University  as  a  Visiting 
Fellow. 

1  Supported  by  the  Cooperative  Rersearch  Center  Program  of  the 
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The  problem  addressed  here,  referred  to  as  set  estimatiorij  is  to  use 
the  above  information  to  determine  a  set  description  of  the  plant 
useful  for  robust  control  design.  For  example,  consider  the  plant 
set' 

M  ;  {v  =  (G  +  AVy)u  +  He  \  ||A|1„^  <  9.  e  €  E(A)  } 

If  E(A)  =  P O W (A),  then  M  is  typical  starting  point  for  oo  con¬ 
trol  design.  If  E(A)  ==  HID  (A),  then  mixed  K^/Hoo  control  design 
methods  apply.  There  are  many  combinations  possible.  However, 
in  all  the  above  cases,  the  quantities  with  “hats"  are  available  a  pri¬ 
ori  to  the  designer,  which  is  not  the  case  here.  These  quantities  are 
to  be  estimated  from  the  data  and  a  priori  information. 

In  the  remainder  of  the  paper,  we  show  how  least-squares  esti¬ 
mation  can  be  used  to  obtain  a  plant  set  estimate.  We  exeunine  two 
cases:  (i)  high  order  model  set  estimation,  and  (ii)  model  set  esti¬ 
mation  via  stochastic  embedding.  Related  techniques  can  be  found 
in  [5]  and  the  references  therein. 


2  Least-Squares  Estimation 

Parameter  estimation  via  least-squares  with  an  ARX^  model 
structiire  is  perhaps  the  most  widely  used  approach  to  system  iden¬ 
tification.  The  standard  (c.y.,  [7])  parametric  ARX  model  set  is: 

Marx  :  D«y  =  Seu  +  e  I  e€R^  e€lID(A)}  (3) 

where 

n  m 

Ae  =  1 -|- 

t=i  i=i 

e  =  [ai  6i  p  =  n -h m 

Thus,  the  ARX  model  output  at  tome  t  is  ^ven  by, 

yt  =  4*T  ^  +  ct 

4T  =  [“Vt-l  •  •  •  -  yt-n  Ut^i  •  •  •  «t-m] 

'If  A  is  stable,  11A1|„^  =  sup^  lA(c^“')l,  otherwize,  I1A||^^  = 
oo. 

^  ARX  is  the  standard  acronymn  for  system  models  with  an  Au- 
toRegressive  term  (Ay)  and  an  cXtemal  input  (Bu). 


The  least-squares  parameter  estimate,  based  on  a  finite  data  record 
is  found  from: 

N 


?=  argnun  i 


(4) 


t=l 


Limiting  Properties  It  is  weU  known  [7]  that 
6  —*>  6 f  as  TV  — ►  oo,  w.p.  1 

where 


“  1 
6  =  argmin  —  /  5crr(u;,  e)du/ 
0  27r  / 

•/  — n 


with  the  “error”  spectrum  given  by, 

S„r{u>,e)  =  l>le(c^“)G(e^“)-Be(c^“)pS„„(u;) 

+  A|Ae(eJ")B(e^“)|2 

Much  more  can  be  said  when  the  system  (1)  is  in  the  ARX  model 
set  (3),  i.e.,  when  there  exists  a  parameter  Bo  e  R**  such  that, 
—  Bgj  =  ^and  Ae^H  =  1.  In  this  case,  6  =  and  the 

parameter  error  Bo  is  asymptotically  normally  distributed  i.e., 
as  N  — ►  oo, 

\/N{6  -  ^o)  (5) 

where  denotes  expectation.  In  addition,  consistent  estimates 
of  and  A,  are  given  by 


with 


t=:l  ‘  t=l 

'ct  —  yt-4>j6 

the  estimated  prediction  error. 


Large  N  Properties  To  simplify  notation  set 

a  =  a^  b  =  b^ 

Define  S  G  RP  as  the  normalized  (unknown)  parameter  error, 

S  =  ^/^(6-Bo)  (6) 

Then,  the  true  system  (1)  can  be  expressed  as: 

Ay  =  Bu  —  -f  c  (7) 

It  follows  that  for  sufficiently  large  TV  [7]: 

SeA/'(0,^)  and  ^ g  ^(p,  _  p)  (8) 

where  ^(p,  .AT  —  p)  is  the  F-distribution  with  degrees  of  freedom  p 
and  TV  —  p.  Hence, 

Prob{«^«  <  ^a(j?)A}  =  q  (9) 

can  be  determined  from  an  F-distribution  table.  To  be  safe,  suppose 
we  set  T)  very  high,  say,  q  =  .999.  Then  for  typical  numbers  such  as 
N  >  1000  and  n  =  m  =  10,  we  get  a(.999)  =  2.27.  For  large  n,  say 
n  =  m  =  60,  and  large  N  >  n,^ve  get  ot(.999)  w  1.45,  and  so  on. 
In  addition,  for  large  AT,  e  6  E(A)  with  high  probabiUty.  Hence,  for 
large  n,  m,  and  N,  the  system  (1)  is  in  the  model  set 

Ay  =  Bu  —  +  e 


M 


ARX 


3  Robust  Control  with  ARX  Set 

In  this  section  we  discuss  the  issue  of  robust  control  design  retain¬ 
ing  still  the  assumption  that  the  true  system  is  in  the  ARX  model 
set  Marx  of  (10).  Suppose  we  apply  the  LTI  feedback  controller 

u=z^Ky  (II) 

where  K  stabilizes  the  “nominal”  ARX  system  (5  =  0), 

Ay  =  Bu  +  e 

Applying  the  control  to  the  actual  system  model  (7),  gives  the 
closed-loop  system 


f  H  =  f  T, 

le-  ^ 

f  ' 

1  “  J  [-Qs 

1  i-s-^h 

.  . 

where 


T  = 


A-bBRT 


K 


A  +  BK 


=  B-’/* 

DT  ' 

,  D  = 

■  z-l  - 

.  . 

Because  K  stabilizes  the  nominal  system,  T,  Q  and'^  are  all  stable. 

Recall  from  the  Nyquist  theorem  that  since  h  is  stable,  the  closed- 
loop  system  is  stable  if  and  only  if,  for  all  a;, 

|1-«’^(«^")I#0,  <  r*  <  r?, ,, 

where  the  “real”  stability  margin  is  given  by, 

r.tab  =  inf  inf  {s'^S  =  r(a;)^  i  | 

w  r(w)  ^  I  '  '  J 

Calculating  r(a;)  involves  finding  the  minimum  norm  (least-sqtiares) 
solution  to  the  over-determined  set  of  equations  =  1  at 

each  frequency.  As  shown  in  [10],  an  easy  calculation  yields. 

Vm\%  6^0 
6  =  0 

where 

a  =  Reh(e^“'),  6=ImTi(e^“') 

Hence,  a  “probabOity  of  stability”  can  be  stated  as  foUows.  Since 
Prob{fi^5  <  ^a(q)A}  =  q 


1  1/lHP, 


(12) 


*(»))A  <  r*  ^  Prob{(l  -  S'^h)-^  stable}  >  q 


it  follows  that 
P 

n' 

In  the  more  general  case,  calculating  the  stability  margin,  for 

other  than  the  two-norm  of  the  parameter  error,  can  be  cast  as  a  a 
convex  programming  problem,  e.g,,  [10].  However,  no  closed  form 
or  convex  programming  solution  is  known  in  the  MIMO  case. 


(10)  4  High-Order  ARX  Model  Set 


c€lID(A) 

with  probability  of  at  least  tj.  Observe  that  the  mod<d  uncertainty  in 
this  set  is  represented  as  a  norm  hound  on  the  noi^alized  parameter 
error  with  a  probability  tag  which  can  be  selected  close  to  one. 
Putting  aside  for  the  moment  the  issue  of  whether  or  not  the  true 
system  is  in  the  model  set,  it  is  therefore  logical  to  pursue  robust 
control  with  this  type  of  parametric  uncertainty. 


The  analysis  in  the  previous  sections  depended  on  the  true  sj'stem 
being  a  member  of  the  parametric  ARX  model  set.  Although  this 
is  never  true,  it  is  a  fact  (e.^.,  [7])  that  the  system  (l)  is  equi^^ent 
to  the  infinite  order  ARX  model: 


Ay  =  Bu  +  < 


(13) 


where 


A  =  =l  +  ^a**-»=  (14) 

fc=sl 

oo 

B  =  (15) 

kz=l 

Because  and  H~^G  oxe  stable,  there  exists  positive  constants 
A^a,A/b,  and  p  <  1  such  that 


described.  First,  if  K2  or  Tioo  methods  are  useed  for  design,  the 
controller  (11)  will  also  be  of  high  order  -  although  in  most  cases  it 
is  easy  to  reduce  the  controller  because  of  the  msuiy  stable  near  pole 
zero  cancelations  in  the  estimated  model.  Secondly,  a  determination 
of  what  is  meant  precisely  by  high  order  is  dependent  on  a  priori 
knowledge  about  the  true  system.  Thirdly,  the  statistical  properties 
are  based  on  very  Wge  data  lengths,  2md  a  precise  value  of  “Iju-ge” 
depends  on  the  true  system  properties. 


\ak  \  <  |6fc|  <  Mfcp* 

It  therefore  follows  that  the  n-th  order  ARX  model, 


Any  =  Bntt  e 

(16) 

n 

An  =  14*^^ 

(17) 

k^l 

B„  = 

(18) 

fc=i 


can,  for  sufficiently  large  n, arbitrarily  well  approximate  the  infinite 
order  ARX  model,  and  hence,  the  original  system.  Specifically,  it 
is  easily  shown  that  IjAnG  and  \\AnH  —  approach 

zero  exponentially  as  n  goes  to  infinity.  As  a  result,  it  is  very 
tempting  to  use  the  previoiis  result  with  very  high  order  models, 
partic^arly  since  it  is  very  easy  to  compute  such  models.  The 
following  example  illustrates  this  approach. 

Example  The  following  example  is  presented  without  figures 
—  no  room!  Take  the  true  as  the  zero*order-hold  equivalent 
of  -f-  2^ns  +  with  fl  =  27r(.l)  rad/sec,  C  =  *15,  and  a 

sampling  frequency  of  2  hz.  Take  the  noise  model  as  H  -  1  with 
c  €  IID(.22)  and  with  input  u  e  nD(l).  Hence,  the  rms  noise 
to  signal  ratio  at  the  output  is  rms(e)/nns(au)  =  .28.  The  sys¬ 
tem  was  simulated  with  u  and  e  drawn  from  the  previous  distribu¬ 
tions.  Least-squares  identification  was  performed  for  time  samples 
t  =  1  :  512  for  model  orders  n  =  m  =  1  :  40.  All  the  models 
were  v^dated  on  a  different  data  set  for  t  =  513  :  1024.  FVom  the 
validation  set,  the  minimum  rms  value  occurs  at  n  =  14.  Because 
larger  orders  show  an  increase  in  rms,  any  models  of  order  higher 
than  14  must  be  trying  to  fit  the  particular  noise  realization  dur¬ 
ing  identification.  The  fits  in  magnitude  are  very  similar,  but  get 
more  noisy  at  high  frequencies.  Since  the  true  noise  has  a  constant 
spectrum,  we  see  that  the  higher  orders  are  being  used  primarily 
to  flatten  the  estimated  spectrum.  One  would  therefore  expect  a 
significant  number  of  pole-zero  cancelations  in  the  estimated  trans¬ 
fer  function.  This  phenomena  is  dramtically  visible  in  a  pole  zero 
plot  of  the  estimated  transfer  function.  Moreover,  it  is  interesting 
to  examine  the  Hankel  singular  values  as  well. 


To  offset  the  high  order  problem,  an  alternative  is  to  use  a  more 
parsimonious  model  pairametrization.  For  ex^mlple,  use  of  Laguerre 
or  Kautz  expansions,  as  proposed  in  [ll],  can  result  in  considerably 
fewer  parameters  to  obtaiin  the  same  level  of  approximation  as  a 
model  expanded  in  the  backward  shift  operator  .  However,  the 
efficacy  of  this  appro2kch  depends  on  prior  information  reg8u*ding  the 
accuracy  of  some  dominant  pole  locations. 

5  Residual  Analysis  and 
Stochastic  Embedding 

The  stochastic  embedding  principle  developed  by  Goodwin  et 
a/. [3]  provides  a  completely  stochastic  framework  for  model  error 
estimation.  No  assumptions  on  model  order  or  data  length  are  re¬ 
quired,  The  basic  idea  is  to  view  model  error  as  a  reeJization  of 
a  random  variable  with  zero  mean,  whose  variance  is  described  by 
a  few  parameters  which  captures  the  structure  of  the  model  error. 
In  other  words,  the  complicated  problem  of  relating  the  bias  to 
the  data  and  the  mismatch  in  structure  of  the  true  system  and  the 
model  is  avoided.  By  modeling  the  error  in  this  seemingly  rudimen¬ 
tary  form,  an  error  model  set  parametrization  is  obtained  which  is 
described  by  a  small  number  of  parameters,  yet  is  capable  of  rep¬ 
resenting  a  large  set  of  error  models.  In  [3]  a  Maximum  Likelihood 
(ML)  approach  is  used  to  define  the  estimates  of  these  model  error 
parameters.  In  this  section,  we  use  the  stochastic  embedding  prin¬ 
ciple,  but  applied  in  a  slightly  different  way,  avoiding  some  of  the 
approximations  involved  in  solving  for  the  ML  estimate.  Although 
no  prior  assximptions  about  high  model  order  are  required,  the  re¬ 
sults  presented  here  —  unlike  those  in  [3]  —  are  asymptotic  in  data 
length. 

To  apply  the  stochastic  embedding  principle  of  [3],  we  proceed  as 
follows:  suppose  we  have  obtained  ARX  estimates  A,  B  from  some 
finite  data  record.  Since  the  true  system  (1)  is  y  =  Gu  +  i/e,  it 
follows  that  the  prediction  error  is: 

e  =  Ay  —  Bu  =  Ati  +  w  (19) 

with 


An  H2  controller  was  designed  \ising  the  14th  order  model.  (We 
know  this  controller  stabilizes  the  true  system,  because  in  this  case 
the  true  system  is  being  simulated.  Normally,  of  course,  the  true 
system  is  unknown  to  the  designer.)  Although  the  closed-loop  sys¬ 
tem  is  stable  and  seems  to  behave  well,  the  robust  analysis  suggests 
testing  the  worst-case  plant  possible  from  the  ARX  model  set.  By 
examing  the  par am ae trie  stability  margins  vs.  frequency  (12)  for 
probability  .95  and  .995,  respectively,  it  can  be  seen  that  at  about 
.5  hz,  the  closed-loop  system  is  just  stable  for  a  .95  probability  and 
just  unstable  for  .995.  The  interpretion  is  that  there  exists  a  plant 
“  ARX  set  which  could  have  produced  the  data  with  the  same 
statistics.  Since  this  pleuit  would  cause  considerable  trouble  for  this 
controller,  the  controller  should  be  redesigned.  ,  e.y.,  the  loop  gain 
should  be  reduced  in  the  .5  hz  range. 

Caveat  Emptor  There  are  several  impediments  to  using  the 
high-order  ARX  estimation  and  robust  control  design  approach  just 


A  =  AG  -  B  (20) 

u;  =  AHe  (21) 

As  before,  our  goal  is  to  make  some  kind  of  estimate  covering  A. 
Towar^  this  end,  form  the  (asymptotic)  prediction  error  correlation 
coefficients: 

c*  =  £(eie‘,+fc),  Vk>0  (22) 

If  t»  €  IID(Au),  then: 


OO 

Aty  -f*  An  ^  ^  6^  , 
issl 
00 

Au  ^ 

•=1 


k  =  0 

k  >  1 


(23) 


where 

Au»  =  €(wt)^ 


(24) 


5'? 


imd  are  the  impulse  response  coefficients  of  A,  i.c., 

CO 

=  (25) 

1=1 

fCr  ^  **  positive  constants  Af,  and  p  <  1  such 

Hencci 


Clearly  a  solution  exists  iff  there  is  a  real  root  0  <  j?  <  i,  gy 
replacing  with  we  get  the  polynomial  in  P/(3: 

L 

L  +  : 


M  <  { 


|5i|  <  Mgp'-^ 

(26) 

,  Af? 

Aui  +  Au  —  0 

1 

(27) 

Au- - ^>1 

1  -  p2 

f  11  •  r  7  T  -  *  ^rrciauon  coefficients  decay  exponen- 

S/'?  \  “  contained  in  the  correlation 

«>efficients  about  the  model  error  A  in  the  form  of  bounds  on  the 
impulse  reponse  coefficients  The  problem  with  (27)  is 

that  only  bounds  are  provided  and  thus  it  is  difficult  to  infer  esti- 

f  f  of  o*.  The  key  is  to  utilize 

tne  stochastic  embedding  put  forth  in  [3], 

Stochastic  Embedding  Make  the  further  assumption  that 
the  model  error  is  a  realization  of  a  stochastic  process  such  that: 


CWly  13  =  p  is  a  solution,  and  certainly  the  desired  one  in  that 
or  -  Of.  In  fac^  It  can  be  shown  that,  on  average,  the  only  real 
solution  is  p  =  p.  Even  though  L  can  be  quite  large,  c.g.,  L  =  1024 
IS  typical,  it  js  actually  very  easy  to  solve  for  ?  by  a  simple  bisection 
searA  over  0.  Although  this  method  U  similar  to  that  developed  in 
p],  here  an  exact  solution  to  the  max-likelihood  problem  is  possible 
because  the  data  is  taken  to  be  the  asymptotic  prediction  error 
correUUon  coefficents,  i.e.,  infiniit  data.  Clearly  errors  wiU  be 
mtroduced  in  cornputing  the  coefficients  using  finHt  kngih  data. 
This  requires  further  analysis. 

Having  found  the  estimate  an  estimate  for  foUows  from 
the  expression  for  cq  i  that  is; 


—  Co  —  Au 


1  - 


(36) 


As  a  result,  there  holds 
^SCk 


;fereiiD(A,) 


I  0. 


(28) 


(29) 


1  —  p'‘  '  ' 

Hence,  the  end  «^t  of  the  max-likelihood/stochastic  embedding 
approadi  is  the  following  ARX  model  set:  ^ 

Ay  =  Bu  +  Au  -k  u» 


fc>l 


M' 


ARX 


Smee  there  is  no  information  in  the  mean  regarding  the  pulse  re¬ 
ponse  decay,  it  is  necessary  to  compute  the  variance.  Thus, 

.  Vfc  >  1  (30) 

^  ^  information  we  seek 

appears  only  in  those  coefficients.  Observe  that  the  model  error  has 
been  captured  by  a  parametric  stochastic  description  with  just  two 
free  parameters  A^  and  p. 

There  are  now  several  ways  to  fit  the  sample  variance  to  this 
expression  and  find  values  for  A,  and  p  which  achieve  the  best  fit 
One  approach  to  estimating  these  parameters  is  to  fit  a  gaussian 

^tnbution  to  the  TOirelation  coefficients  cfc,  k  >  1.  That  is,  assume 
^t  fc*  I  A:  _  1  :  A}  is  anmdependentgaussiansequencewithzero 

mean  and  variance  a/3*  where  0  <  /?  <  1.  Thus, 


-=E 

kzzl 


(37) 


9*  €  IID(As) 

S(wt)^  z=  Au, 


(31) 


3=0^  « — 

md  .Ktiimtes  for  A«  and  p  can  be  obt^ed  from  estimates  of  a  and 
p.  inat  IS,  given  ot  and  (3^ 

,>=VS,  =  (32) 

Taking  a  max-likelihood  approach,  the  negative  log  likelihood  func- 
tion  is  then; 

L 

^  ~  2^  E  j  logo  -k  f).  log/?  -k  constant  (33) 

Jtssl  ^ 

The  minimizing  V2Llues  a  and  j?  satisfy: 


*=i 


(34) 


(35) 


6  Robust  Control  Analysis 

In  this  section  we  discuss  the  issue  of  robust  control  design  un- 
der  the  that  the  true  system  is  in  the  ARX  model  set 

tt^RX^of  (37).  Following  the  dUcussion  in  section  ,  the  control 

“  u  stabilize  the  nominal  ARX  system  Ay  =  Bu  -k  e 

nm  .ubJiz,  PU,.,.  m  1*  th,  do„d-Jp 

_  ■-Er.'s.w.. 

is  stable  where  Q  =  K/(l  +  GK)  as  before  and  now 

^  hfc(z)  =  Q(z)?‘-tz-fc  (39) 

^Iwmv  (^«).  stability  is  meant  again  as  a  “probability  of 

^  ““''rtiun  parameters  are  bounded  in 

gmtude,  rather  than  m  an  ellipsoid  as  before.  SpecificaUy,  since 

Prob{sup9j  <  «(»j)A}  =  t) 

it  follows  that 

Prob{(l  -  ^  h*(;z)9*)-t  stable}  >  „ 

*=1 

In  this  case  the  stability  margin  is  defined  as: 

r(w)»  £A*(e^“'),*  =  l  1  (40) 

,  ,  '  fc=l  J 

Although  tWs  is  an  iiifinite  dimensional  convex  programming  prob¬ 
lem,  nonetheless  a  solution  is  readily  obtained  [10]. 


^•tib 


c  vAic  BLAouiLy  ma 

—  inf  inf  <  g?  =  i 
W  r(w)  I  * 


7  Concluding  Discussion 

Two  least-squares  based  methods  have  been  presented  for  ob¬ 
taining  ARX  model  sets.  The  first  is  obtained  using  properties  of 
high-order  ARX  models  and  the  second  uses  a  stochastic  embedding 
scheme  on  the  residuals  from  an  ARX  model  of  2tny  order.  Various 
asymptotic  approximations  are  used  in  obtaining  either  set,  and 
their  effect  is  under  study.  Either  of  the  ARX  model  sets  is  useful 
for  robust  control  of  systems  with  uncert^dn  paramaters.  However, 
in  this  case  the  uncertainty  is  random  and  so  a  notion  of  probability 
of  stability  is  introduced.  In  other  unpublished  work  of  the  authors, 
methods  are  being  developed  for  analyzing  the  more  practical  situ¬ 
ation  where  the  data  set  is  finite. 

In  this  paper  only  some  of  the  many  properties  of  least-squares 
parameter  estimation  with  ARX  models  have  been  examined.  There 
are  meiny  properties  of  lecLst- squares  estimation  that  may  yet  be 
profitable  for  obtaining  set  estimators  useful  for  robust  control  de¬ 
sign.  For  example: 

1.  The  least-squ€iures  solution  is  unique  and  provides  the  global 
minimiim  in  closed-form,  which  in  the  ideal  case  allows  for  a 
complete  asymptotic  statistical  analysis. 

2.  By  utilizing  “square-root”  numerical  algorithms,  i.c.,  SVD  or 
QR  algorithms,  the  solution  can  be  efficiently  and  rapidly  com¬ 
puted  even  for  very  high  orders.  In  particular,  with  the  QR 
method: 

(a)  all  models  are  simultaneously  computed  up  to  a  speci¬ 
fied  maximum  order  and  stored  in  a  square-root  matrix 
whose  size  is  determined  by  the  majcimum  model  order, 
not  data  length. 

(b)  prediction  error  variamce  vs.  model  order  (up  to  the 
maximum)  is  available  as  the  last  “column”  of  QR. 

(c)  results  of  different  experiments  are  easily  joined,  t.c., 
there  is  no  need  to  re-compute  over  both  data  sets. 

(d)  high-order  models  are  easily  computed;  very  rapid  com¬ 
putation  is  possible  with  lattice  algorithms. 

3.  The  impulse  response  coefficients,  up  to  the  order  of  the  nu¬ 
merator  polynomi2d,  are  eisymptotically  unbiased. 

4.  The  estimates  have  orthogonality  properties  which  are  not  de¬ 
pendent  on  assiunptions  about  the  character  of  the  uncertain 
disturbance,  e.g.^  gausssiein,  worst-case,  and  so  on. 

Propery  3  is  quite  appealing.  Specifically,  if  the  input  u  is 
white,  then  the  first  m  impulse  response  coefficients  of  the  true 
input /output  transfer  function  G  are  asymptotically  unbiased.  Re¬ 
call  that  m  is  the  order  of  B©,  the  nmnerator  polynomial  in  the 
ARX  model  set.  It  is  interesting  that  the  result  does  not  depend  on 
n,  the  order  of  the  ARX  denominator  polynomial.  Moreover, 
the  true  system  need  not  be  in  the  ARX  model  set! 

This  result  was  originaly  proven  in  [9].  Other  useful  results  fol¬ 
lowing  from  this  fact  can  be  found  in  [l].  To  summarize  the  result. 


arg  min  S(yt  —  0^<f>t)^ 

(41) 

sent*’ 

1 

—  00 

(42) 

Thus  {jj  1 52 » •  •  •}  pulse  response  sequence  associated  with 

the  asymptotic  transfer  function  estimate  of  G  in  (1)  which  has  the 
pulse  response  sequence  {51,521  •  •  That  is, 

fc=i 


The  result  from  [9]  is  as  follows: 

u€lID(Au)=>5fc  VA:  =  l:m  (43) 

This  result  together  with  the  use  of  an  appropriate  affine  model 
exp^lnsion  (c.5.,  Laguerre  or  Kautz)  may  prove  most  beneficial.  An 
affine  model  set,  c.5.,  a  Laguerre  expansion  for  G,  can  offset  the 
issue  of  determining  what  is  meant  by  a  large  data  length.  More¬ 
over,  with  this  model,  it  is  possible  to  precisely  compute  statistical 
properties  for  any  given  model  order  or  data  length  -  no  asymp¬ 
totic  assumptions  are  required,  c.5.,  [6].  The  stochastic  embedding 
scheme  developed  in  [3]  uses  such  an  approach,  and  if  applied  as 
proposed  here,  it  may  be  possible  to  eliminate  all  the  asymptotic 
requirements,  t.c.,  large  data  length  or  high  model  order. 
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Uncertainty  Model  Unfalsification;  A  System  Identification  Paradigm 
Compatible  with  Robust  Control  Design 

Robert  L.  Kosut*'^ 


Abstract  To  falsify  or  unfalsify  -  that  is  the  question. 

It  is  shown  that  unfcilsification  of  the  standard  robust  control  de¬ 
sign  uncertednty  model  is  a  natural  replacement  for  system  identifi¬ 
cation  when  the  intended  xise  of  the  model  is  robust  control  design. 
For  the  ARX  model,  the  unfalsification  step  requires  solving  a  set  of 
convex  programming  problems,  specifically  LMI  problems,  of  which 
ordinary  lecist- squares  is  one  member.  The  result  is  a  tradeoff  curve 
between  model  imcertainty  and  disturbance  uncertainty.  Hence,  a 
family  of  models  are  unfalsified  from  the  data  record.  The  tradeoff 
curve  is  given  a  frequency  domain  interpretation  via  the  DFT  and 
related  computational  issues  are  discussed. 

Introduction 

Traditional  system  identification  must  be  modified  in  order 
to  be  compatible  with  robust  control  design.  The  nature  of 
the  modification  arises  from  two  fundamental  principals  un¬ 
derlying  robust  control  design:  (1)  models  are  inaccurate,  and 
(2)  a  good  control  design  can  withstand  model  inaccuracy. 
Hence,  robust  control  design  does  not  require  a  unique  model. 
But  traditional  identification  extracts  a  unique  model  from 
the  data,  i.e.,  as  described  by  Ljung  [11],  a  criterion  is  posed 
and  minimized  over  a  set  of  candidate  models.  Unfortunately, 
there  is  no  means  to  provide  a  measure  of  the  identified  model 
uncertainty,  t.e.,  bounds  on  model  error  and  disturbances. 

In  contrast  to  system  identification,  a  model  is  said  to  be 
validated  if  and  only  if  it  could  have  produced  the  data.  Val¬ 
idation  is  perhaps  a  misnomer,  as  one  can  never  prove  that 
a  model  will  be  able  to  accurately  predict  the  future.  How¬ 
ever,  the  data  can  falsify  a  model,  i.e.,  the  model  may  prove 
to  be  incapable  of  producing  the  data.  Hence,  a  precise  term 
is  unfalsification. 

Clearly  unfalsification  is  a  feasibility  problem  -  find  a 
member  of  the  model  set  that  is  consistent  with  the  data. 
Identification  poses  an  optimization  problem,  e.g.j  pick  the 
unfalsified  model  that  has  the  smallest  noise  variance.  Why? 
It  is  certainly  possible  that  some  other  unfalsified  model  would 
produce  better  closed-loop  performance. 

In  consequence,  for  robust  control  design,  it  is  logical  to 
replace  system  identification  with  unfalsification.  In  addition, 
postulate  a  model  set  suitable  for  robust  control,  t.e.,  an  un¬ 
certain  fy  model  which  includes  both  disturbance  and  dynamic 
uncertainty  descriptors.  Thus,  in  summary,  two  changes  must 
be  made  in  the  system  identification  process: 

1.  postulate  a  candidate  model  set  which  includes  descrip¬ 
tors  for  both  disturbance  and  dynamic  uncertainty 

2.  use  the  data  to  produce  all  unfalsified  models  in  the  can¬ 
didate  model  set. 
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Having  produced  a  set  of  unfalsified  uncertainty  models^  the  re¬ 
maining  question  is  how  to  select  an  appropriate  model.  Since 
the  ultimate  goal  is  closed-loop  design,  we  are  naturally  led  to 
evaluating  each  unfalsified  uncertainty  model  by  designing  and 
implementing  a  robust  controller  based  on  the  model.  Thus, 
we  have  the  following  iterative  scheme: 

1.  Find  all  the  unf2Llsified  uncertzdnty  models  corresponding 
to  the  current  data  set. 

2.  Test  each  unfalsified  model,  in  some  order,  by  applying  a 
robust  controller  based  on  a  particular  unfalsified  uncer¬ 
tainty  model. 

3.  Those  controllers  that  yield  performance  better  than  or 
equal  to  that  predicted  do  not  falsify  the  underlying  de¬ 
sign  model.  As  soon  as  performance  is  worse  than  pre¬ 
dicted,  the  underlying  model  is  falsified.  The  unfalsifica¬ 
tion  process  can  then  be  repeated  with  this  new  data. 

Hence,  actual  measured  data  is  used  to  rank  the  models  via 
the  effect  of  a  corresponding  robust  control  design. 

This  paper  principally  focuses  on  the  unfalsification  step 
with  a  brief  discussion  of  a  particular  iterative  scheme  which 
follows  the  broad  outline  described  above. 

Previous  Research  Several  approaches  have  been  put 
forward  which  involve  iterating  between  identification  and 
control  design,  e.y.,  [15,  8,  20,  7].  A  good  survey  is  provided  by 
Gevers  [4].  A  fundamental  difficulty  with  iterative  approaches 
is  that  in  order  to  prove  (analytically)  that  the  iteration  will 
work  requires  solving  very  difficult  fixed-point  problems,  e.p., 
[!]• 

The  framework  for  uncertainty  model  unfalsification  and  ro¬ 
bust  iterative  control  design  presented  here,  which  avoids  some 
of  the  difficulties  encountered  with  many  iterative  schemes, 
was  inspired  by  the  work  of  PooUa  et  al.  [13].  This  approach 
is  also  heralded  in  the  short  essay  by  Dahleh  and  Doyle  [3], 
with  further  development  in  the  paper  by  Livestone  et  al.  [9] 
and  dissertation  [10].  The  uncertainty  tradeoff  curves  derived 
here  was  first  presented  in  [7],  but  at  that  time  there  was  no 
framework  upon  which  to  provide  a  raison  d^efrefor  the  result. 

These  approaches  differ  from  that  proposed  by  Safonov  et 
al.  [17,  18,  14]  where  a  controller  set  is  postualted  and  mem¬ 
bers  are  unfalsified  directly  from  input/output  data.  In  a 
sense,  no  model  is  required  except  that  implied  by  the  per¬ 
formance  criterion.  In  addition,  controllers  can  be  falsified 
without  having  to  be  implemented.  In  model  unfalsification  as 
described  here,  a  natural  ordering  of  models  is  provided  based 
on  model  error  magnitude.  Thus,  a  combination  of  model  and 
controller  falsification  may  prove  to  be  the  most  practical. 

Organization  of  Paper  The  first  section  provides  a  brief 
review  of  classical  least-squares  prediction  error  methods  of 


system  identification.  This  motivates  uncertainty  modeling 
and  the  subsequent  unfalsification  procedure  which  links  iden¬ 
tification  and  robust  control  in  a  natural  iteration.  Frequency 
domain  interpretations  are  presented  which  provide  a  means 
for  efficient  computation. 

Prediction  Error  (PE)  Identification 

This  section  provides  a  brief  review  of  the  classical  para- 
metric  prediction  error  method  of  system  identification.  The 
formulation  and  notation  closely  follows  the  presentation  in 
the  textbook  by  Ljung  [11]. 

Data  and  True  System  Let  and  denote,  repec- 
tively,  input  and  output  data  which  has  been  recorded  at  the 
N  integer  sample  times  t  =  1  :  jv.  Thus, 

(1) 

For  purposes  of  analysis,  we  make  the  following  assumptions 
about  the  true,  but  unknown  system,  which  generated  the 
data: 

•  The  true  system  is  stable  and  operating  in  open  loop, 
with  input/output  behavior  described  by, 

y  =  Gu  +  V  (2) 

where  G  is  a  causal,  stable,  linear-time-invariant  system 
with  unknown  (zero-order-hold)  transfer  function  G{z), 
and  u  is  a  disturbance  with  spectrum  Svv{t^)  which  is 
independent  of  u,  t.e.,  =  0.  It  is  also  assumed 

that  u  has  spectrum  5txu(t^). 

The  identification  problem  is  to  map  the  finite  data  (1)  into 
estimates  of  G{z)  and  5vv(w),  with  the  additional  constraint 
that  we  want  the  estimates  to  be  “good  enough”  for  closed- 
loop  control  design.  The  first  step  is  to  pick  a  candidate  model 
set  which  might  be  able  to  explain  the  data. 

Prediction  Error  (PE)  Model  Classical  prediction  er¬ 
ror  identification  postulates  the  model  set: 

y  =  Geu^Hew,  OeO,  tn€Wwht(<T)  (3) 

where  w  is  an  unpredictable  sequence  drawn  from  the  set 
Wwht(of)  consisting  of  white  noise  sequences  with  variance  <r^y 
Ge  and  He  are  causal,  linear- time-invariant  systems,  initially 
at  rest,  with  transfer  functions  Ge{^z)  and  He{z\  respectively, 
and  are  dependent  on  a  parameter  vector  B  in  the  set, 

e={eeli^  |  LT-'  and  n^^Ge  are  stable  }  (4) 

As  explained  in  [11],  parameters  in  the  above  set  insure  that 
the  predictor  associated  with  the  model  set  is  stable.  Specifi¬ 
cally,  the  corresponding  one-step  ahead  prediction  error  is, 

ee-  Hg^iy^Geu)  (5) 

Hence,  ^  €  0  implies  that  (j/, «)  — »■  e©  is  a  stable  system. 
Moreover,  if  the  data  (1)  was  actually  generated  from  (3), 
then  ee{t)  — ►  w{t)  as  <  — oo. 

It  is  often  the  case  that  the  prediction  error  is  modified  by 
a  data  filter,  F(z),  that  is,  ee  is  replaced  by  Fee.  Typically, 
E  is  a  (stable  and  minimum  phase)  pass-band  filter.  As  will 
be  seen,  the  data  filter  is  particularly  important  for  indirectly 
manipulating  model  error  (bias). 


Observe  that  the  data  filter  could  be  considered  part  of  the 
disturbance  model  by  replacing  He  with  F^^He.  Hence,  an 
alternative  form  of  the  model  set  (3)  is: 

y  =  Ggu  +  F-^  Hew,  e  €  &,  w  €  Wwht(o-)  (6) 

which  leads  to  the  (filtered)  prediction  error 

ee  =  FHQ^(y  -  Geu)  (7) 

The  filtered  model  (6)  will  be  used  throughout  the  remainder 
of  the  paper  and  referred  to  as  the  prediction  error  model, 
abbreviated  by  PE  model. 

Least- Squares  Identification  System  identification,  in 
the  prediction  error  context,  usually  means  performing  the 
following  ‘^east-squares”  optimization: 

N 

On  =  (8) 

where  e$(t^  is  an  element  of  the  prediction  error  data  sequence, 

^0  =  {e.(t)}^l,  =  FHe\y^  -  Geu^)  (9) 

When  the  true  system  is  in  the  postulated  model  set,  the  op¬ 
timal  error  spectrum  is  constant  and  equal  to  the  variance 
of  the  white  noise  sequence.  When  the  true  system  is  not 
in  the  model  set  —  the  usual  case  in  practice  —there  is  no 
true  parameter  and  the  estimated  transfer  function  has  a  “bi¬ 
ased”  frequency  response,  t.e.,  there  is  no  ^  G  0  such  that 
Gie^^^)  -  ^^(e^-)  =  0  for  all  a; 

As  shown  in  [11],  the  mean-square  model  error  can  be  com¬ 
puted  asymptotically  for  large  data  length  n  and  large  model 
order  p.  Specifically,  for  large  n,  large  p,  and  small  p/n, 

E \G{en  -  ~  S.  Mid 

^  Ouu(w) 

where  E  denotes  expectation  with  respect  to  the  noise  v  in  (2). 
However,  a  quantification  of  “large”  or  “smaU”  is  not  known 
and  thus  the  above  expression  cannot  be  used  to  reliabb'  pre- 
diet  model  error. 

AR.X  Model  Structure  A  practical  difficulty  with  using 
the  general  model  structure  (6),  is  that  there  is  no  guaranty 
that  the  least-squares  criterion  has  a  unique  global  minim  urn, 
although  in  most  cases  standard  descent  methods  of  optimiza¬ 
tion  will  attain  a  local  minimum.  Partly  because  of  this  rea¬ 
son,  the  most  widely  used  model  structure  is  the  ARX  model 
where: 


Ge{z) 

=  Be{z)IAe{z),  He{z) 

=  1/Ae{z) 

Ae{z) 

—  l-t-OiZ 

■”A 

Beiz) 

=  6,2-'  -f  + 

e 

=  [“1  6i  •••6„^]^ 

€R'’,  p  = 

TIA  +  UB 

Hence,  the 

standard  ARX  model  is: 

Aey  = 

'  Beu  -f-  F~^w,  0  g  R**, 

w  £  Wwhi 

(a)  (10) 

with  the  corresponding  prediction  error, 

ee  =  F(Aey  —  Be 

'«) 

(H) 

which  is  affine  in  0.  Thus,  least  squares  optimization  is  con¬ 
vex,  and  the  minimizing  0  is  readily  and  efficiently  obtained 
numerically  using  SVD,  OR,  or  lattice  methods. 


Modeling  for  Robust  Control 

The  type  of  model  used  for  robust  control  design,  referred 
to  as  an  uncertainty  model  incorporates  an  additional  term 
which  accounts  for  a  class  of  dynamic  uncertainty. 

PE  Uncertainty  Model  Specifically,  the  PE  model  (6) 
is  modified  as  follows  to  obtain  the  uncertainty  model: 

y  =  GeU’\-F~^ He{w-\-Au)y  ^  G  0,  tJt;GW((T),  A  G  A(7) 

(12) 

where  W((t)  is  a  set  of  bounded  sequences  with  bounding 
parameter  cr  -  not  necessarily  white  noise  sequences.  Observe 
that  now  the  model  includes  a  dynamic  uncertainty  set,  A(7), 
with  7  denoting  a  system  bound.  The  most  common  set  used 
for  robust  control  design  is  the  set  of  Ho©  bounded  systems, 

A(7)  =  {A(0)  I  ||A||^  <7}  (13) 

with  the  corresponding  prediction  error, 

ee  =  FHQ^{y  —  Geu)  =  11;  +  Au  (14) 

In  addition  to  the  “random”  component,  w  G  W(<t),  the  pre¬ 
diction  error  also  has  a  “bias,”  or  dynamic  uncertainty  com¬ 
ponent,  A  G  A  (7).  In  particular,  with  the  true  system  given 
by  (2),  w  and  A  are, 

w  =  A  =  FHe\G  -  Ge)  (15) 

In  the  ensuing  discussion,  let  W((t)  be  the  set  of  sequences 
with  bounded  RMS,  i.e., 

W(<t)  =  {w  I  <  <7  }  ,  ||u>||^„,  :=  lim  ^  X^«’(<)^ 

'  iV— >00  N  ' 

t=l 

(16) 

When  the  parametric  model  has  the  ARX  structure,  the 
uncertainty  model  becomes: 

Aey  =  Beu  +  F~'^{w  +  Au),  ||w||,„s  <  <^,  ||A||^  <  7  (17) 

with  6  G  and  the  corresponding  prediction  error, 

ee  =  F{Aey  —  Beu)  =  ty  +  Au  (18) 

If  the  true  system  is  given  by  (2),  then  w  and  A  become, 

w  =  FAev,  A=:  F{AeG-  Be)  (19) 

Observe  that  if  F  is  a  lowpass  filter,  then  outside  the  filter 
bandwidth  the  uncertainty  can  be  very  large,  whereas  inside 
the  filter  bandwidth  the  uncertainty  could  be  modest. 

Unfalsification 

Given  some  input /output  data,  the  problem  is  to  find  a 
consistent  set  of  uncertainty  model  parameters  (^,  7,  <j).  This 
problem  is  similar  to  the  uncertainty  model  validation  problem 
addressed  by  PooUa  et  al.  [13].  In  the  case  investigated  there, 
the  uncertainty  model  parameters  are  known  and  the  uncer¬ 
tainty  model  is  validated  (unfalsified)  from  data.  We  use  the 
same  theory  here  (in  reverse)  to  estimate  the  parameters. 

We  first  define  the  unfalsification  problem  for  the  PE  un¬ 
certainty  model  (12),  and  then  specicdize  to  the  ARX  case. 

•  PE  Uncertainty  Model  Unfalsification 


The  uncertainty  model  (12) 

y==Geu+F-^He{wAAu),  6  e  &,  HUs  <  <^.  II^IL  <  7 

is  said  to  be  unfalsified  by  the  jv-point  data  sequences 

(1). 

=  {«(<)}f=: 

if  and  only  if  there  exists  ^  G  0,  an  N-point  sequence 
=  {ty(t)}|^i,  and  a  system  A,  such  that: 

Halloo  ^  ^  (20) 

and 

^  (21) 

where  Cq  is  the  N-point  prediction  error  data  sequence, 

e^  ^F He  -Geu^)  (22) 

The  key  for  establishing  whether  an  uncertainty  model  is  un¬ 
falsified  is  to  prove  (20).  For  LTI  systems  we  have  the  following 
result  which  is  stated  in  [13]. 

•  LTI  Unfalsification  Given  Jv-point  sequences  = 

and  =  {^(t)}^!,  there  e.xists  a  stable, 
causal,  LTI  system  such  that 

Halloo  ^  7 

if  and  only  if  (23) 

T{z^fr{z'^)  <  7^t(«^)^t(«^) 

where  T(-)  maps  an  Jv-point  sequence  into  an  iv  x  n  lower 
triangular  Toeplitz  matrix,  e.^., 


u(l)  0 

0  - 

II 

u(2)  u(l) 

0 

(24) 

1 

.  w(n)  u{n  —  1)  •  - 

’•  "(1)  - 

Applying  the  above  result  yields  the  following: 

•  PE  Uncertainty  Model  Unfalsification  The  PE  un¬ 
certainty  model 

y  =  Ge‘ii-\-F~'^He{'w+Au),  0  €  0,  ll^t'llrms  ^ ll'^lloo  ^  7 

is  unfalsified  by  the  N-point  data  sequences  {y^  ^u^)  if 
and  only  if  there  exists  ^  G  0  and  an  jv-poiiit  sequence 
=  {^(l)j  •  •  •  >  tjw(jv)}  such  that: 

T(ef  -  w^)'^T{e^  -  w^)  < 

(25) 

~vec(tt;'^)^vec(ty^)  < 

From  (23),  the  first  inequality  (25)  is  necessary  and  sufficient 
for  the  existence  of  an  uncertain  dynamic  system  11A||^  <  7 
which  maps  -*  .  The  second  inequality  in  (25) 

is  (obviously)  necessary  and  sufficient  for  the  existence  of  an 
jv-point  disturbance  sequence  satisfying  ^ 


As  shown  in  the  recent  text  by  Boyd  et  al  [2],  the  inequal¬ 
ities  in  (25)  can  also  be  expressed  as  matrix  inequalities,  e.g., 

■  7  r(u^)"'T(u^) 

>  0 

(26) 

;^vec(tn^)^  ' 

>  0 

.  In 


Since  (28)  is  a  convex  optimization,  it  follows  that  the  tradeoff 
function,  is  convex  in  7.  The  model  uncertainty  bound, 
7,  can  range  between  the  two  obvious  extremes:  when  the 
prediction  error  is  due  only  to  the  disturbance  and  when  it  is 
due  only  to  the  dynamic  uncertainty. 

The  first  extreme  point  is  when  all  the  prediction  error  is 
assumed  to  be  due  to  the  disturbance,  t.e.,  when  7  =  0.  It 
follows  from  (28),  that  7  =  0  if  and  only  if  =  e^.  Hence, 
we  obtain  the  usual  least-squares/ ARX  solution: 

<r{0)  =  <7u  :=mm||eS'||^^^ 


For  the  general  PE  model  structure,  these  matrix  inequalities 
are  affine  in  the  unknown  variables  7  £  R+,  <7  €  R+,  and 
vec(ti;^)  G  R^,  but  not  in  the  parameter  ^  €  R^,  except  in 
the  ARX  case  which  we  will  discuss  in  the  next  section. 

Uncertainty  Tradeoff  Curve  It  is  clear  that  for  any 
given  value  of  7  there  is  a  minimum  value  of  tr,  the  disturbance 
bound,  which  is  consistent  with  the  data.  For  each  dynamic 
uncertainty  bound  7,  denote  this  value  by  cr(y).  The  graph 
of  <^(7)  versus  7  establishes  a  tradeoff  curve  between  model 
uncertainty,  7,  and  disturbance  uncertainty  <7.  The  curve  also 
separates  the  unfalsified  and  falsified  uncertainty  models  based 
on  the  current  data.  The  curve  <7(7)  will  be  referred  to  as  the 
unceretainty  tradeoff  curve. 

The  general  PE  uncertainty  tradeoff  curve  is  obtained  by 
solving  the  following  optimization  problem: 

•  Unfalsified  PE  uncertainty  Tradeoff  Curve 
Fix  7  and  perform  the  optimization: 

(t(7)  :=  min  cr,  subject  to  (26)  (27) 

6,  <7, 

Denote  the  minimizing  values  of  6  and  by  ^(7)  and 
tu^(7). 

Since  is  not  affine  in  0  for  the  general  PE  structure,  it 
follows  that  (27)  is  not  a  convex  optimization  and  the  tradeoff 
function,  <^(7),  is  therefore  not  necessarily  convex  in  7,  but  it 
is  piece^wise  continuous. 

The  range  of  7  values  must  also  be  determined.  We  wUl 
examine  the  specific  properties  of  the  tradeoff  curve  for  the 
ARX  uncertainty  model. 

ARX  Uncertainty  Model  The  above  result  applies  to 
unfalsification  of  the  ARX  uncertainty  model  (17)  with  ©  = 
RP.  However,  in  the  ARX  case,  is  affine  in  0.  Hence, 
the  matrix  inequalities  (26)  become  linear  matrix  inequalities 
(LMPs)  in  the  unknown  variables  7  £  R+,  <7  G  R-i-,  ^  € 
RP,  and  vec(u;^)  G  R^'.  Consequently,  to  find  those  (6»,  7,  (7) 
satisfying  (25)  requires  solving  a  convex  feasibility  problem 
[2]. 

The  ARX  uncertainty  tradeoff  curve  is  then  obtained  by 
solving  the  following  convex  optimization  problem: 

•  Unfalsified  ARX  uncertainty  Tradeoff  Curve 
Fix  7  and  perform  the  optimization: 

cr{y)  :=  min  cr,  subject  to  the  LMPs  in  (26)  (28) 

0,  <7, 

Denote  the  minimizing  values  of  6  and  by  ^(7)  and 

t?^(7). 


(29) 

e{0)  =  Su  :=argnim||e^||^^^ 

Consequently,  we  can  now  provide  yet  another  interpretation 
of  least-squares:  it  produces  the  minimum  variance  unfalsified 
uncertainty  model.  Thus,  no  ARX  model,  Aey  = 
can  reproduce  the  data  with  a  noise  level  smaller  than 

The  second  extreme  point  on  the  tradeoff  curve  is  when  all 
the  prediction  error  is  assumed  to  be  due  to  the  dynamic  un¬ 
certainty,  i.e.,  when  <7  =  0.  Let  <7(7)  =  0  where  7  is  obtained 
by  solving  (28)  with  =  0.  That  is, 

7  =  min  7,  subject  to  r(e^)^T(e^)  < 
cr,  7 

(30) 

It  is  not  always  the  case  that  a  solution  exists  for  <7  =  0.  As 
will  be  shown  later,  there  may  be  a  non-zero  lower  bound,  a, 
such  that  <7(7)  =  a-  for  7  >  7.  In  any  case,  the  unfalsified 
tradeoff  curve  is  given  more  explicitly  by: 

{^^(7)  I  TC  [0,7]}  (31) 

A  consequence  of  the  convexity  of  <7(7),  is  that  the  two  extreme 
points  establish  the  affine  upper  bound: 

<^(7)  <  d'(7)  :=  <7is  (1 -7/7),  0<7<7  (32) 

The  figure  below  depicts  a  typical  tradeoff  curve  separating 
unfaJsified  from  falsified  uncertcdnty  models. 


<7 


Frequency-Domain  Analysis 

The  time-domain  conditions  for  uncertainty  model  unfal¬ 
sification  can  be  transformed  into  the  frequency  domain  by 
application  of  the  discrete- Fourier-transform  (DFT).  This  wiU 
also  provide  further  insight  into  the  meaning  of  the  tradeoff 
curve  as  well  as  providing  an  efficient  means  for  computation. 
Discrete-Fourier-Tranform  (DFT)  Given  an  /^-point 
time-domain  sequence  =  {a:(t)}^i,  define  the  function 
Xn(lv)  by 

N 


(33) 


Xhe  DFT  of  x  is  defined  as  the  //-point  complex  sequence 

(34) 

where  the  //-frequencies,  a;„  =  n  =  1, at  are  re¬ 

ferred  to  as  the  DFT  frequencies.  The  time  sequence,  = 
is  completely  recoverable  from  the  DFT  via  the  in¬ 
verse  transform: 

N 


Relation  of  DFT  and  Toeplitz  Form  The  foUowing 
result  is  established  in  [5]; 

=  sup  |A’Ar(u;)|^  (36) 

where  Amax  denotes  the  maximum  eigenvalue.  Hence,  for  suf¬ 
ficiently  large  //, 

(^T(x^)’’T(x^))  «  max  |Xj^(w„)f  (37) 


LTI  Unfalsification  The  above  approximation  can  be  ap¬ 
plied  to  LTI  unfalsification  (23).  It  is  shown  in  [6]  that  for 
sufficiently  large  n,  the  data  sequences  and  unfalsify 
the  LTI  uncertainty  model  z^  =  Au^  with  ||A|1^  <  7  if  and 
only  if, 


max 

ne^^u 


^Ar(cJn) 


<  7, 


Afu  :=  {n  I  UN{u;n)  ^  0} 


(38) 


PE  Tradeoff  Curve  Let  Ee,N{iOn),  ^/^(wn),  yjv(wn),  and 
14^Ar(u7n),  denote  the  DFT’s,  respectively,  of  the  //-point  se¬ 
quences  ^  and  .  The  above  approximation  yield 

the  following  large- ;v  approximation  of  (25): 


max 


EB^N{u)n)  —  Wj^{u)n) 
U^iiOn) 


<7, 


N 

|l^//(t*^n)|^  <  <T^ 

n=l 


(39) 

Consequently,  the  large-N  approximation  of  the  PE  tradeoff 
curve  reduces  to: 


<1? 


+  ^  MFe.7^(w„)| 


Ee^Nj^n) 

UN{iOn) 


(40) 

Of  course  no  approximation  is  needed  for  computing  a’(O), 
which  is  precisely  the  least-squares  solution,  t.e.,  by  ParsevaPs 
Theorem, 


N 

(j(0)^  =  afg  =  |£’0,,,//(a;n)|^  (41) 


n=l 

In  addition,  the  large-//  approximation  of  the  value  of  7  for 
which  <7(7)  =  0,  7  from  (30),  is  given  by, 


7 


min  max 
S  neMu 


^e,Nii^n) 

UN{(^n) 


(42) 


and  the  corresponding  disturbance  bound  is  not  necessarily 
zero,  i.e., 


CTitf  «  min  ^  IEe,w(a.'„)|^ 

^  ntUu 


(43) 


ARX  Tradeoff  Curve  The  previous  expressions  are  of 
course  identical  in  form  for  the  ARX  uncertainty  model,  with 
the  notable  exception  that  all  the  indicated  optimizations  in¬ 
volve  convex  programming. 

Computational  Issues  The  DFT  approximations  reveal 
some  of  the  computational  issues.  First,  with  the  ARX  uncer¬ 
tainty  model  structure,  the  DFT  approximations  allow  very 
rapid  calculations  to  find  the  optimum  [6].  However,  it  is  well 
known  that  dividing  the  DFT  of  an  output  sequence  by  the 
DFT  of  an  input  sequence  is  a  very  noisy  estimate  of  the  sys¬ 
tem  frequency  response,  e.y.,  [11,  Ch.6].  But  this  is  precisely 
what  is  suggested  by  the  dynamic  uncertainty  constraint  in 
(39).  Hence,  even  though  the  calculations  are  rapid,  they 
could  give  quite  misleading  results.  There  are  several  ways 
to  reduce  the  poor  quality  of  this  estimate.  It  is  standard 
practice  to  replace  the  DFT  with  spectral  calculations,  which 
typically  involve  time  and  frequency  window  selection.  This 
brings  us  back  to  some  of  the  earlier  methods  for  caculating 
model  error,  e.y.,  [7].  We  wiU  not  pursue  this  important  prac¬ 
tical  aspect  any  further  here,  but  [12]  provides  the  framework 
under  which  the  spectral  calculations  would  fit  under  the  un¬ 
falsification  umbrella. 


Iterative  Unfalsified  Robust  Control 


In  this  section  we  provide  a  brief  discussion  on  how  to  se¬ 
lect  an  uncertainty  model  on  the  tradeoff  curve  which  is  suit¬ 
able  for  control  design.  Because  any  uncertainty  model  corre¬ 
sponding  to  a  point  along  the  tradeoff  curve  is  unfalsified  by 
the  current  data  record,  it  follows  that  a  corresponding  robust 
controller  is  also  unfalsified  by  the  current  data  record.  Hence, 
a  controller,  can  only  be  falsified  hy  a  new  data  record.  How¬ 
ever,  a  new  data  record  is  obtained  when  the  robust  controller 
is  implemented.  This  simple  observation  leads  immediately 
to  the  foUowing  natural  iterative  unfalsification  and  robust 
control  design  procedure: 

•  Unfalsification  Based  on  the  current  finite  data  record, 
solve  (28)  for  the  unfalsified  tradeoff  curve.  This  step  in 
the  iterative  procedure  replaces  the  identification  step. 

•  Robust  Control  Design  Corresponding  to  each  fea¬ 
sible  point,  (7,<t(7))  on  the  tradeoff  curve  (31),  design 
a  robust  controUer,  t.e.,  a  controller  which  attains  the 
best  closed-loop  performance  possible  for  the  given  uncer¬ 
tainty  model  parameters.  Hence,  a  family  of  unfalsified 
robust  controllers  are  obtained  which  are  parametrized  by 
7,  the  model  uncertainty  bound,  A  large  value  of  model 
uncertainty  wiU  yield  a  cautious  control^  and  conversely, 
a  small  value  of  uncertainty  wiU  yield  a  more  aggressive 
control.  Since  the  family  of  robust  controllers  are  based 
on  the  estimated  tradeoff  curve,  there  is  no  guaranty  that 
any  wiU  perform  as  predicted.  Hence,  the  controUers  can 
only  be  falsified  by  real-time  implementation.^ 

•  Real-Time  Control  Implementation  The  uncer¬ 
tainty  “knob,”  7,  aUows  the  unfalsified  controUers  to  be 
implemented  in  real-time  in  a  logical  order,  i.e.,  the  first 
controUer  implemented  should  have  a  large  uncertainty 

At  this  stage  the  methods  of  controUer  falsification  proposed 
by  Safonov  ei  al.  [14,  17,  18]  could  be  used  to  falsify  candidate 
controUers  before  they  are  implemented. 


setting.  If  the  actual  performance  is  better  than  the  pre¬ 
dicted  performance,  then  the  controller  (and  the  under¬ 
lying  uncertainty  model)  are  still  unfalsified.  In  this  case, 
the  controller  can  be  made  more  aggressive  by  gradually 
decreasing  the  uncertainty  level  until  either: 

—  a  desired  closed-loop  performance  is  achieved. 

—  actual  performance  is  worse  than  the  predicted  per¬ 
formance. 

If  the  first  condition  occurs,  controller  design  iterations 
cease  and  performance  is  still  monitored  until  the  sec¬ 
ond  condition  occurs,  if  ever.  If  the  second  condition 
occurs,  the  controller  (and  the  underlying  uncertainty 
model)  have  been  falsified  In  this  case,  the  closed-loop 
data  can  be  used  to  generate  a  new  estimate  of  the  un¬ 
falsified  family  of  tradeoff  curves.  If  these  new  curves  are 
not  significantly  different  than  the  previous  curves,  then 
more  and  more  data  can  be  used  with  perhaps  more  vig¬ 
orous  probing  signals.  If  this  fails,  then  the  uncertainty 
model  can  be  modified  in  a  number  of  ways,e.g.,  increase 
the  nominal  (ARX)  model  order,  reshape  or  change  the 
bandwidth  of  the  data  filter,  modify  input  spectra,  etc. 

It  is  certainly  conceivable  that  performance  cannot  be  further 
improved  because  the  limit  of  performance  has  been  achieved. 
But  this  cannot  be  known  without  knowing  the  system  in  the 
first  place.  It  is  also  possible  that  more  fundamental  changes 
must  be  made  to  the  uncertainty  model.  For  example,  im¬ 
posing  an  RMS  bound  on  the  disturbance  uncertainty  may  be 
too  restrictive.  This  can  be  relaxed  by  adding  the  constraint 
that  the  disturbance  is  uncorrelated  with  the  input  -  as  would 
be  the  case  for  stochastic  signals.  This  is  easily  added  to  the 
tradeoff  optimization  problem  via  the  convex  constraint: 

=  0 

The  disturbance  can  be  completely  altered  by  postulating  a 
set  of  disturbances  with  bounded  spectrum  (see  [12,  6])  and 
including  the  above  constraint  as  well. 

Stochastic  sets  are  also  possible,  and  again  these  would  lead 
to  convex  optimization  problems  where  now  there  would  be  a 
probability  tag”  associated  with  the  family  of  tradeoff  curves 
and  corresponding  robust  controllers. 

Fmally,  failing  all  else,  if  the  designer  is  “convinced”  that 
the  limit  of  performance  has  not  been  achieved,  then  it  is  pos¬ 
sible  that  the  underlying  linear  system  assumption  is  wrong. 
That  is,  a  linear  model  cannot  capture  some  nonlinear  phe¬ 
nomena  that  was  presumed  to  be  negligible,  but  is  actually  sig¬ 
nificant,  and  is  hindering  further  performance  improvement. 
In  this  case,  there  is  strong  motivation  to  re-think  the  physical 
modeling  assumptions  and  to  conduct  experiments  with  small 
signal  perturbations  about  nearby  operating  points  to  see  if 
the  tradeoff  curves  vary  significantly.  If  so,  this  also  suggests 
that  there  is  an  important  nonlinear  effect  which  should  be  ac¬ 
counted  for  directly  in  the  control  design, e.^.,  gain  scheduling, 
feedback  linearization,  etc. 
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Abstract  -  Unfalsification  replaces  system  identification  when  the  intended  use  of  the  model 
is  robust  control  design.  Unfalsification  results  in  a  family  of  uncertainty  models,  all  consistent 
with  the  data,  which  tradeoff  model  uncertainty  and  disturbance  uncertainty.  The  family  of 
unfalsified  models  is  used  in  an  iterative  approach  to  system  identification  and  robust  control 
design. 


1  Introduction 

“T/tc  test  of  all  knowledge  is  experiment.  Experiment 
is  the  sole  judge  of  scientific  ‘truth.’  But  what  is  the 
source  of  knowledge?  Where  do  the  laws  that  are  to  be 
tested  come  from?  Experiment,  itself,  helps  to  produce 
these  laws,  in  the  sense  that  it  gives  us  hints.  But  also 
needed  is  imagination  to  create  from  these  hints  the  great 
generalizations  —  to  guess  ...  2Uid  then  to  experiment  to 
check  again  whether  we  have  made  the  right  guess.” 

-  Richard  P.  Feynman  [3,  Vol.  I] 

Applying  Feynman’s  inspirational  description  of  the 
Scientific  Method  to  the  problem  of  finding  a  model  from 
data  gives  the  algorithm:  record  data,  pick  a  model,  vali¬ 
date  the  model,  if  the  model  is  no  good  pick  a  new  model, 
validate  the  new  model  with  new  data,  and  so  on.  It  is 
helpful  of  course,  as  Feynman  puts  it,  to  use  one’s  “imag¬ 
ination”  to  arrive  at  “the  great  generalizations.”  For  the 
problem  addressed  here,  this  means  pick  a  model  which  is 
sufficiently  accurate  for  use  in  control  design.  This  paper 
is  about  how  to  do  that. 

The  work  described  here  is  a  continuation  of  [6]  which 
addressed  the  problem  of  how  uncertainty  model  unfal¬ 
sification,  as  first  described  by  Poolla  ei  ai  [13],  could 
be  used  to  replace  the  system  identification  step  in  adap¬ 
tive  control.  In  this  paper  the  connection  between  ro¬ 
bust  control  and  unfalsification  is  made  more  precise.  An 
iterative  adaptive  robust  control  algorithm  is  presented 
which  incorporates  the  model  falsification  of  Dahleh  ei 
ai  [2,  9],  the  controller  falsification  approach  of  Safonov 
ei  ai  [14],  and  some  elements  of  the  “windsurfer”  adap¬ 
tation  devised  by  Anderson  and  Kosut  as  explained  in 
[7,  8].  Due  to  limited  space,  the  contributions  of  these 
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search,  Directorate  of  Mathematical  Computer  Sciences,  under 
Contract  No.  F49620-93-C-0012. 


and  other  references  will  be  cited  as  needed  in  the  text 
-  a  more  complete  listing  and  discussion  can  be  found  in 
[6],  which  is  briefly  reviewed  in  the  next  section. 

2  Uncertainty  Modeling 

The  starting  point  for  robust  control  design  is  an  un- 
ceriainiy  model  which  accounts  for  parametric,  dynamic, 
and  disturbance  uncertainty.  The  specific  form  consid¬ 
ered  here  (described  in  detail  in  [6])  is  referred  to  as  a 
prediciion  error  (PE)  unceriainiy  model: 

(  See 

y  =  GeU-hHB{w-^Au)  ^  lIHlrms  <  ^  (1) 

I  i|A|lH„  <  ^ 

where  y  and  u  are,  respectively,  the  observed  output  and 
input  sequences,  w  is  an  uncertain  sequence  rms-bounded 
by  <T,  A  is  an  uncertain  transfer  function  Hcx^ -bounded  by 
6,  Ge  and  He  are  causal,  linear-time-invariant  systems, 
initially  at  rest,  with  transfer  functions  Ge{z)  and  He{z)^ 
each  dependent  on  a  parameter  vector  6  in  the  set, 

0  =  {0  E  I  and  H^^Ge  are  stable}  (2) 

Parameters  in  0  insure  that  the  predictor  associated  with 
(1)  is  stable  [11],  t.e., 

ee  :=  Hs\y  -  Geu)  (3) 

Observe  that  the  PE  uncertainty  model  (1)  is  character¬ 
ized  by  ihree  types  of  parameters,  (^,  cr,  6)  --  the  standard 
PE  model  is  characterized  by  two  types  of  parameters, 
{6,  a). 

3  Unfalsification 

For  robust  control  design,  the  parameters  {d,  <r,  S)  are 
known.  In  some  C2ises  0  can  also  be  uncertain, i.e.,  known 


only  to  be  in  a  subset  of  0.  This  is  important  particu¬ 
larly  if  6  represents  uncertain  physical  parameters.  In  this 
paper  we  restrict  attention  to  the  problem  of  estimating 
(6,  cr,  from  data.  Hence,  0  are  used  here  as  canonical 
parameters  only  to  define  a  center  for  the  uncertainty  set. 

Classical  system  identification  is  not  the  solution.  It 
poses  an  optimization  problem  in  (^,^),  and  it  does  not 
deal  with  the  dynamic  uncertainty  ||A||jj  <  6,  which  is 

of  critical  importance  for  robust  control. 

In  contrast,  a  model  is  said  to  be  validated  if  and  only 
if  it  could  have  produced  the  data.  Validation  is  perhaps 
a  misnomer,  as  one  can  never  prove  that  a  model  will  be 
able  to  accurately  predict  the  future.  More  precisely,  and 
perhaps  closer  to  Feynmans  thoughts,  the  data  can  falsify 
a  model,  i.e,,  the  model  may  prove  to  be  incapable  of  fully 
explaining  the  experiment.  Hence,  instead  of  validation, 
we  use  the  more  precise,  but  awkward  term:  unfalsifica’ 
lion. 

Clearly  unfalsification  is  a  feasibility  problem  -  find  a 
the  model  set  whose  members  are  consistent  with  the 
data.  As  shown  in  [6,  13],  this  philosophical  shift  al¬ 
lows  the  dynamic  uncertainty  bound  6  to  be  estimated 
(unfalsified)  along  with  0  and  cr. 

Specifically,  let  and  denote,  respectively,  input 
and  output  data  which  have  been  recorded  at  N  integer 
sample  times  t  —  I  :  n,  that  is, 

(4) 

Let  denote  the  corresponding  prediction  error  se¬ 
quence,  i.e., 

=  -Geu^)  (5) 

Unfalsification  is  defined  as  follows  [6,  13]. 

•  Unfalsification 

The  uncertainty  model  (1)  is  said  to  be  unfalsified 
by  the  7V-point  data  sequences  (y^ ,  if  and  only 
if  there  exists  6,  A(z),  and  an  N-point  sequence  , 
such  that:  0  €  0,  llA||jj  <6,  <  <t,  and 

(1)  is  consistent  with  the'Sata,  i.e., 

=  w^  +  Au^  (6) 

The  following  data  dependent  test  of  unfalsification  is 
taken  from  [6],  and  is  derived  from  the  results  in  [13]. 

•  PE  Uncertainty  Model  Unfalsification  The  PE 

uncertainty  model  (1)  is  unfalsified  by  the  7V-point 
data  sequences  if  and  only  if  there  exists 

6  £  G  and  an  iv-point  sequence  such  that: 

T{e^  -w^)  <  6^T{u^fT(u^) 

(7) 

—veciw^^^vedw^)  < 


where  T(*)  maps  an  iv-point  sequence  into  an  x  iv 
lower  triangular  Toeplitz  matrix  ,  e.g., 


■  w(l) 

0 

0  * 

T(«^)  = 

«(2) 

u(l) 

0 

(8) 

[  u{n)  u{n  -  1)  • 

••  «(1). 

As  shown  in  [6] ,  an  uncertainty  tradeoff  curve  is  obtained 
by  solving  the  following  optimization  problem: 

♦  Unfalsified  Tradeoff  Curve 
Fix  5  and  perform  the  optimization: 

(t(6)  :=  min  a,  subject  to  (7)  (9) 

e,(X, 

Denote  the  minimizing  values  of  6  and  by  6{6) 
and  u;^(5). 

Remarks 

(1)  The  graph  of  cr{S)  versus  6,  referred  to  as  the  «n- 
certainty  tradeoff  curve,  establishes  the  tradeoff  between 
model  uncertainty,  8,  and  disturbance  uncertainty  a.  To 
every  point  on  the  tradeoff  curve  there  is  a  different  set 
of  nominal  transfer  functions  {G$,He)  because  6  =  ^(6), 

(2)  The  tradeoff  curve  separates  the  unfalsified  and  fal¬ 
sified  uncertainty  models  based  on  the  current  data.  (Fig¬ 
ure  1  shows  a  typical  tradeoff  curve  for  the  ARX  uncer¬ 
tainty  model).  Solutions  in  the  unfalsified  (feasibility) 
set  may  not  be  unique.  In  addition,  there  is  no  easy 
parametrization,  and  attempts  to  discretize  the  space 
may  lead  to  huge  dimensions  even  for  a  few  parameters. 

(3)  Let  {6,(t,5)  denote  the  parameters  of  the  uncer¬ 

tainty  model  (1)  satisfying  (7).  This  is  essentially  a  fea¬ 
sibility  set  because  all  that  is  required  is  the  existence  of 
(A,u;,0)  such  that  e^  =  w  +  Au^ ,  a{6)  <  <  <t 

and  ||A||ji^  <  6. 

(4)  The  model  uncertainty  bound,  6,  can  range  between 
the  two  extremes:  when  the  prediction  error  is  due  only 
to  the  disturbance  (^  =  0)  and  when  it  is  due  only  to  the 
dynamic  uncertainty  ((7  =  0).  In  the  latter  case, 

So  =  mm  8  (10) 

subject  to 

T{e^fT{e^)  < 

Unlike  the  whole  of  the  unfalsified  region,  uncertainty 
models  along  the  tradeoff  curve  are  easily  parametrized, 
i.e.,  0{S),cr{S)O  <  Sjq. 

(5)  For  the  special  case  when  6  =  0,  the  corresponding 
uncertainty  model  on  the  tradeoff  curve  defined  by  the 
parameter  triple  (^(0),  cr(0),  0)  is  precisely  the  usual  least- 
squares  prediction  error  transfer  function  estimate. 


<7 


(6)  In  general,  the  tradeoff  curve  is  not  convex  because 
cb  is  not  affine  in  6  except  in  the  ARX  case,  which  is 
described  below. 

(7)  Notice  that  no  mention  has  been  made  about  the 
“true”  system  which  generated  the  data  -  it  need  not 
be  linear.  All  that  is  claimed  is  that  there  exists  a  set 
of  linear  uncertainty  models^  each  of  which  could  have 
produced  the  data. 


ARX  Uncertainty  Model  The  ARX  model  is  the 
most  widely  used  model  structure  for  standard  system 
identification  [11].  In  this  case: 


Ge{z) 

Aeiz) 

Bb{z) 

6 


Bejz)  „  ,  .  _  I 
MzY  F{z)Mz) 

1  +  aiz~^  H - 1-  anZ~"'' 

biZ~^  +  •  • •  +  bnZ 

[ai---a„^  p  =  nA+nB 


where  F{z)  is  the  data  filter  included  as  part  of  the  noise 
model  He{z).  Hence,  the  (filtered)  ARX  prediction  error, 

ee  ==  FiAgy  -  Beu)  (11) 


Figure  1:  ARX  Tradeoff  Curve 


would  be  the  case  for  stochastic  signals.  These  are  easily 
incorporated  in  computing  the  tradeoff  (9)  by  adding  the 
linear  constraints: 


zero-mean: 


independence: 


=  0  (13) 

=  0  (14) 

N 


is  affine  in  0,  from  which  it  follows  that  0  =  R^,  the 
tradeoff  curve  cr(6)  is  convex  in  (5,  and  the  optimizing 
values  0(5)  and  w^{6)  are  unique.  Of  practical  impor¬ 
tance  is  that  the  optimization  problem  (9)  is  now  convex 
because  the  matrix  inequalities  (7)  become  linear  matrix 
inequalities  (LMFs)  in  the  unknown  variables  5  G  R-i-, 
a  G  R+,  0  G  R^,  and  vec(iy^)  G  R^.  As  shown  in  [1], 
(7)  can  also  be  expressed  as 


'  8  T{u^fT{u^)  T(e^  -  ‘ 

T(e^  —  w^)  8  In 


L  ;;;^vec(u;^)  (T  In 


>  0 


>0 


(12) 


This  expression  more  clearly  exposes  the  LMI  structure 
when  ee  is  affine  in  6. 


Alternately,  the  disturbance  can  be  completely  character¬ 
ized  by  bounding  it’s  spectrum  (see  [12]). 

Stochastic  dynamic  uncertainty  descriptions  as  in  [5] 
would  lead  to  convex  optimization  problems  where  now 
there  would  be  a  “probability  tag”  associated  with  the 
family  of  tradeoff  curves  and  corresponding  robust  con¬ 
trollers. 


4  Unfalsified  Robust  Control 

A  robust  control  can  be  designed  for  every  unfalsified 
uncertainty  model  and  then  a  new  closed-loop  experiment 
can  be  run  “to  check  again  ...”  But,  as  already  remarked, 
the  set  of  unfalsified  uncertainty  models  satisfying  (7)  is 
too  large  for  practical  purposes.  Therefore,  candidate  un¬ 
certainty  models  for  robust  control  design  will  be  chosen 
from  among  those  along  the  tradeoff  curve,  t.e.,  (1)  with 
the  parameter  set  {0(5),  (r(5)  |  0  <  5  <  5o}  found  from 
(9),  To  emphasize  that  the  candidate  uncertainty  models 
are  unfalsified,  as  well  as  simplifying  notation,  let 


ARX  Tradeoff  Curve  Figure  1  depicts  a  typical  ARX 
uncertainty  model  tradeoff  curve  separating  unfalsified 
from  falsified  uncertainty  models. 


Gunf  •=  G0(^s)i  ^unf  He(6) 
Let  r  denote  the  reference  command  and  let, 
u  =  Knnf{r  -  y) 


(15) 

(16) 


Additional  Constraints  Imposing  only  an  rms  bound 
on  the  disturbance  uncertainty  may  be  too  harsh,  result¬ 
ing  in  unrealistic  (malicious)  disturbances.  A  more  real¬ 
istic  situation  is  obtained  by  restricting  the  disturbance 
to  be  a  zero-mean  sequence  independent  of  the  input  -  as 


be  an  unfalsified  robust  controller^  that  is,  K^nf  stabilizes 
the  candidate  unfalsified  uncertainty  model.  Hence,  the 
closed-loop  transfer  functions, 


Tunf  •“ 


^unf-^unf 
1  "h  GunfJ^unf 


Qunf 


■^unf 


1  “h  ^unf 


(17) 
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5unf  •—  1  ~^unf)  ^unf-^^unfj  5unfGunf»  and  Qunf^^unf  are 
all  stable.  In  addition: 

^  IIQunf-^unfllH^  <  1  (18) 

Recall  that  this  condition  is  necessary  and  sufficient  to 
insure  stability  of  the  closed-loop  unfalsified  uncertainty 
model  for  all  ||A||jj^  <  6.  However,  it  is  neither  nec¬ 
essary  nor  sufficient  lor  determining  if  /Cynf  will  actually 
stabilize  the  true  system.  Recall  Feynman’s  words:  "Ex¬ 
periment  is  the  sole  judge  of  scientific  ‘truth.’  ”  So  until 
we  perform  an  experiment  with  /Cunf  as  the  controller, 
we  cannot  know  if  it  is  stabilizing  even  if  (18)  is  false. 
However,  there  is  a  sensible  order  in  which  to  experiment 
with  candidate  controllers,  and  as  shown  by  Safonov  ei 
al  [14],  some  of  them  can  be  eliminated  (falsified)  with¬ 
out  running  a  new  experiment.  We  will  return  to  this  in 
the  next  section. 

If  there  was  no  uncertainty  in  the  model  (tt;  =  0,  A  = 
0),  then  the  nominal  unfalsified  closed-loop  response 
would  be  Tunfr.  Non- zero  disturbance  and  dynamic  un¬ 
certainty  yields  the  unfalsified  closed-loop  performance  er¬ 
ror. 

Cunf  y  ^unf  ^  (f  ^) 

—  ( un  un  fA\  /_  *^unf  H  unf  ^ 

\  1  +  Qunf^unf^  /  \  1  -f  Qunfi/unf-^  / 

The  above  expression  can  be  thought  of  as  an  unfalsified 
uncertainty  model  of  the  closed-loop  system  performance 
error. 

Under  the  assumptions  that  uncertainty  (A,  w)  can  be 
drawn  from  the  sets  ||A||jj^  <  6  and  \\w\\^^^  <  cr(6), 
and  if  (18)  holds,  then  the  unfalsified  closed-loop  system 
satisfies  the  following  condition: 
Vn>landV||r«||^^3<oo, 

INunfllrms  —  «^unf  (Tunf  i  1"”) 

where 


tainty  model  associated  with  /<"unf  is  falsified  by  the  new 
data.  More  precisely: 

•  Model  Falsification 

Perform  a  closed- loop  experiment  with  control  /Cunf 
and  record  new  data  (y^pt,  r^pt),  and 

^expt  J^expt  ^unf  ^expt  (^1) 

If  3n  E  [1  :  n]  such  that: 

ll^exptllrjns  ^  *^unf  (^unf ,  T’expt)  (22) 

then  the  uncertainty  model  for  which  RTunf  has  been 
designed  is  falsified  by  the  new  data. 

Remarks 

(1)  A  more  informative  expression  signifying  (22)  is: 
the  controller  is  falsified  by  the  new  data  with  respect  to 
the  performance  criterion  (20). 

(2)  The  data  used  for  unfalsifying  the  uncertainty 
model  used  in  the  design  of  is  different  than  data 
taken  from  the  experiment  with  ATunf  in  the  loop.  Hence, 
if  (22)  holds  then  the  new  data  set,  which  has  falsified 
the  previously  unfalsified  model,  can  be  used  to  compute 
a  new  tradeoff  curve.  This  observation  suggests  an  itera¬ 
tive  procedure  for  adaptive  control  and  will  be  discussed 
in  a  later  section. 

(3)  It  is  possible  that  (22)  holds,  t.e.,  the  uncertainty 
model  is  falsified  by  the  new  data,  but  performance  sat¬ 
isfies  a  desired  specification.  For  example,  suppose  (22) 
holds  and  Vn  E  [1  :  at], 

ll^«xpt||nns  “  *^spec(^unf  >  ^expt)  (23) 


(20)  where 

«4pec(T,  r)  :=  7spec  ll^’"llrms  ^  ‘^spec  (24) 


^unf  (T,  r)  7unf  -h  aunf 

un  lE  un  f/[r„nf]  m  I 

l-6\Q 

un  f-^funfl  IlHoo 

Suni  -^unf 

1-^S\Q 

un  f-f^unfl  II 

where  [Tunfjm  is  the  minimum  phase  factor  of  Tunf  •  The 
same  caution  regarding  (18)  applies  here  as  well:  there 
is  no  guaranty  that  (20)  holds  when  a  closed-loop  exper¬ 
iment  is  run  with  ATunf- 

5  Uncertainty  Model  Falsification 

As  shown  in  [9],  if  (20)  does  not  hold  with  data  from 
a  new  closed- loop  experiment  with  ATunf,  then  the  uncer¬ 


7unf  • —  b 
^unf  ^(^) 


In  this  case  the  experiment  provides  a  proof  that  the 
controller  Kunf  meets  a  desired  specification,  at  least 
for  one  sequence  r^p^.  If  this  sequence  is  sufficiently 
rich  in  spectral  content,  it  is  highly  unlikely  that  further 
experimentation  is  required  to  falsify  ATunf  with  repect 
to  /spec(runf,r).  However,  a  different  unfalsified  model 
could  still  result  in  better  performance  -  this  has  not  been 
ruled  out. 

(4)  The  above  experimental  validation  (falsification)  of 
the  controller  A^unf  is  indirect,  because  the  controller  is 
designed  from  an  unfalsified  uncertainty  model.  This  can 
lead  to  a  possibly  wide  gap  between  unfalsified  predicted 
performance  and  actual  performance.  It  is  possible  to 
directly  falsify  controllers  without  having  to  try  them  out 
on  the  actual  system.  This  is  described  next. 


6  Controller  Falsification 


7  Iterative  Adaptive  Control 


In  a  series  of  papers  by  Safonov  ei  al,  (see  [14]  and 
the  references  therein),  it  is  shown  how  to  directly  fal¬ 
sify  a  candidate  controller  before  it  is  implemented.  This 
is  a  significant  result,  because  time  consuming  and  un¬ 
necessary  experimentation  is  eliminated.  The  procedure 
for  controller  falsification  is  essentially  the  same  as  that 
for  uncertainty  model  unfalsification,  but  applied  to  the 
controller.  What  is  different  is  that  the  “model”  now  be¬ 
comes  the  controller,  and  the  “uncertainty”  becomes  the 
closed-loop  specification. 

Specifically,  suppose  we  are  given  data  and 

want  to  know,  without  additional  experimentation^  if  the 
candidate  controller  /funf  will  fail  to  meet  a  desired 
closed-loop  specification.  A  simplified  version  of  the  pro¬ 
cedure  described  in  [14]  is  as  follows. 

♦  Controller  Falsification 

A  candidate  controller  ATunf?  designed  for  an  uncer¬ 
tainty  model  unfalsified  by  the  data  (t/^,  is  falsi¬ 
fied  with  repeat  to  performance  J spec (Tlinf  j  If 

satisfying 

=  Kunf{r^^-y^)  (25) 

and  3n  G  [1  :  n]  such  that, 

Ikbadlirms  ^  »^spec(Tunf  j  ^bad)  (25) 

with 

(27) 

Remarks 

Controller  falsification  is  a  feasibility  problem  in  the 
variable  Solving  for  from  (25)  gives, 

=  (28) 

Using  (17)  gives, 

^bad  =  5unf  (/"  -  Gunfw"^)  (29) 

Tunfr^ad  =  -  5unf(y'^  -  Gunfu"^)  (30) 

Observe  that  (Sunf,5unfGunf)  are  stable  by  construction 
(17),  hence,  are  bounded  even  though  ||r^adlLs 

could  be  very  large. 

It  can  be  shown  that  ebad  is  precisely  the  performance 
error  that  would  have  been  obtained  had  the  controller 

U  ~  Arunf(^bad  y)  (^^) 

been  applied  to  the  unfalsified  uncertainty  model  for 
which  Kunf  is  designed.  As  a  result,  using  (20), 

Ikbadllrms  ^  -^unf  (Tunf ,  T'bad),  (32) 

The  practical  consequence  is  this:  falsification  [with 
repect  to  Junf(Tiinf,  r)]  of  the  controller  (26)  implies  fal¬ 
sification  of  the  model  (22).  This  reinforces  the  notion 
expressed  by  Safonov  et  al.  [14]  that  direct  controller 
falsification  is  a  sharper  test  than  the  indirect  approach 
using  model  falsification  (22). 


The  preceding  discussion  of  model  and  controller  falsi¬ 
fication  suggests  a  “natural”  iterative  approach  to  adap¬ 
tive  control.  The  idea  is  to  verify,  ultimately  by  experi¬ 
ment,  the  performance  of  all  robust  controllers  designed 
for  the  family  of  unfalsified  uncertainty  models  along  the 
^-dependent  tradeoff  curve.  The  unfalsified  controllers 
should  be  either  falsified  or  implemented  in  decreasing 
order  of  6,  i.e.,  the  first  controller  implemented  should  be 
a  cautious  controller  with  a  large  uncertainty  setting.  . 
More  precisely,  the  control  can  be  iteratively  adapted  by 
adhering  to  the  following  steps: 

1.  initialize  performance  specification 

»^spec(7',  —  Tspec  llT’rH  rms  +  “spec 

2.  record  new  jv-point  batch  of  data 

(j^,u^,r^)  =  (y^,u^,r%.tch 

3.  compute  tradeoff  parameters 
use  (9)  with  (t/^,«^)  to  obtain: 

{0(6),  (t{6)  \  0<S<So} 

4.  start  with  cautious  controller 
set  6  =  6o 

5.  robust  control  design 

(a)  design  controller  Kunf 

(b)  compute  Junf  (Tlmf ,  t*)  from  (20) 

6.  controller  falsification 
if  (26)  holds,  z.e., 

llebadlirms  >  Jspec(Tunf }  ^bad) 

then  GO  TO  Step  10 

7.  new  experiment  &:  record  data 

(a)  implement  Kunf 

(b)  record  closed-loop  data 

8.  evaluate  performance 
if  (26)  holds,  t.e., 

Ikexptllrms  ^  «^spec(Tiinf ,  T^^pt) 

then: 

(a)  reset  to  last  controller  satisfying  Jspec 

(b)  replace  (y^.ti^.r^)  with  (y^,u^,r^)expt,  and 
GO  TO  3. 


9.  model  falsification 
if 

Ihexptlirms  ^ 

then  replace  (y^, «",  r^)  with  (y^, «^,  r^)expt,  and 
GO  TO  3 

else,  if  Jtinf  <  Jgpecj  then  replace  <7spec  with  «7unf- 

10.  decrease  6 

if  6  =  0,  GO  TO  Step  2 

else,  decrease  6  and  GO  TO  Step  5 

Remarks 

(1)  The  algorithm  falsifies  and/or  implements  con¬ 
trollers  along  a  tradeoff  curve,  and  when  these  are  ex¬ 
hausted,  a  new  batch  of  data  is  recorded  and  the  process 
is  repeated.  Controllers  are  implemented  only  when  they 
fail  to  be  falsified  by  current  data  with  respect  to  the  best 
observed  performance. 

(2)  In  each  new  experiment  the  reference  command 
^^pt  consist  of  a  nominal  sequence,  such  as  a  step 
or  square-wave,  plus  a  small  random  component  which 
could  be  added  to  the  test  reference  and/or  directly  to 
the  control  input.  Just  as  in  classical  system  identifica¬ 
tion,  a  sufficiently  rich  excitation  is  required  to  obtain 
useful  estimates,  e.g.,  [11,  4]. 

(3)  For  each  unfalsified  model  the  controller  can  be  de¬ 
signed  in  any  number  of  ways,  and  moreover,  there  are 
many  possible  robust  controllers.  For  example,  the  prob¬ 
lem  formulation  suggests  a  mixed  H2/H00 -optimization, 
but  that  involves  choices  of  weightings,  all  of  which  effect 
Tunf ,  the  nominal  response  to  a  reference.  A  more  direct 
approach  is  suggested  in  [7,  8]  where 

Qunf  =  TA/[Gunf]m  (33) 

where  [Gunf]m  denotes  the  minimum-phase  factor  of  Gunf , 
and  Tx  is  the  desired  (minimum  phase)  closed-loop  trans¬ 
fer  function  from  r  to  y  with  bandwidth  A.  The  conve¬ 
nience  of  this  approach  is  that  the  two  important  design 
‘"knobs”  are  directly  accessible,  namely,  model  error  S  and 
desired  closed-loop  bandwidth  A. 

(4)  A  large  value  of  model  uncertainty  6  together  with 
a  low  bandwidth  A  will  yield  a  cautious  control.  Con¬ 
versely,  a  small  value  of  uncertainty  8  together  with  a 
large  bandwidth  A  will  yield  a  more  aggressive  control. 
If,  in  addition,  Tx  is  used  as  the  data  filter  when  pro¬ 
ducing  the  tradeoff  models,  then  it  becomes  easier  to  link 
unfalsified  models  with  the  desired  performance. This  ad¬ 
ditional  degree  of  design  freedom  is  easily  added  to  the 
iterative  algorithm. 

Concluding  Remarks 

An  iterative  approach  to  robust  adaptive  control  has 
been  proposed  where  unfalsification  of  an  uncertainty 


model  replaces  classical  system  identification.  For  the 
ARX  model,  the  unfalsification  step  requires  solving  a  set 
of  convex  programming  problems,  specifically  LMI  prob¬ 
lems,  of  which  ordinary  least-squares  is  one  member.  The 
result  is  a  tradeoff  curve  between  model  uncertainty  and 
disturbance  uncertainty.  Hence,  a  family  of  models  are 
unfalsified  from  the  data  record.  A  natural  order  of  de¬ 
sign  is  to  start  with  a  cautious  controller  designed  for  an 
unfalsified  uncertainty  model  with  a  large  value  of  dy¬ 
namic  uncertainty.  As  the  model  uncertainty  is  slowly 
decreased,  actual  closed-loop  data  is  used  to  evaluate  (un¬ 
falsify  or  falsify)  the  model  and/or  the  controller. 

References 

[1]  S.  Boyd,  L.  Ei  Ghaoui,  E.  Feron,  and  V.  Balakrishnein,  Lin¬ 
ear  Matrix  Inequalities  in  System  and  Control  Theory,  SIAM 
Studies  in  Applied  Mathematics,  1994. 

[2]  M.A.  Dahleh  and  J.C.  Doyle,  “From  data  to  control”,  Proc. 
Workskoy  on  Modeling  of  Uncertainty  in  Control  Systems, 
Springer- Verlag,  1992. 

[3]  R.  P.  Feynman,  R.B.  Leighton,  and  M.  Sands  The  Feynman 
Lectures  on  Physics,  Addison- Wesley,  1963. 

[4]  M.  Gevers,  “Towards  a  joint  design  of  identification  and  con¬ 
trol,”  Essays  on  Control:  Persyectives  in  the  Theory  and  its 
Ayylications,  editors:  H.L.  Trentehnan  and  J.C.  Willems,  pp. 
111-151,  Birkhauser,  Boston,  MA,  1993. 

[5]  G.C.  Goodwin,  M.  Gevers,  and  B.  Ninness,  “Quantifying  the 
error  in  estimated  transfer  hmetions  with  applications  to  model 
order  selection,”  IEEE  Trans,  on  Automatic  Control,  vol.  37, 
no.7,  pp.  913-928,  July  1992. 

[6]  R.L.  Kosut,  “Uncertainty  model  unfalsification;  a  system  iden¬ 
tification  paradigm  compatible  with  robust  control  design,” 
Proc.  1995  CDC,  New  Orleans,  LA,  Dec.  1995. 

[7]  W.S.  Lee,  B.D.O.  Anderson,  R.L.  Kosut,  and  I.M.Y.  Mareels, 
“A  new  approach  to  adaptive  robust  control,”  Ini,  J.  of  Aday- 
tive  Control  and  Signal  Processing,  vol.  7,  pp.  183-211,  1993. 

[8]  W.S.  Lee,  B.D.O.  Anderson,  I.M.Y.  Mareels,  and  R.L.  Ko¬ 
sut,  “On  some  key  issues  in  the  windsurfer  approach  to  robust 
adaptive  control” ,  Automaiica,  Vol.  31,  No.  11,  pp.  1619-1636 
1995. 

[9]  M.M.  Livestone,  M.A.  Dahleh,  and  J.A.  Farrell,  “A  framework 
for  robust  control  based  model  invalidation” ,  Proc.  1 994.  A  CC, 
pp.  3017-3020,  B^lltimore,  MD,  June  1994. 

[10]  M.M.  Livestone,  Identification,  Robust  Adaytation,  and  Itera¬ 
tive  Schemes,  Ph.D.  dissertation,  MIT  Report  LIDS-TH-2273 
Oct.  1994. 

[11]  L.  Lj\mg,  System  Identification:  Theory  for  the  User,  Prentice- 
Hall,  1987. 

[12]  M.  Massoumniaand  R.L.  Kosut,  “A  family  of  norms  for  system 
identification  problems,”  Proc.  1993  ACC,  San  Francisco,  CA, 
June  1993,  and  IEEE  Trans.  Aui.  Conir.,  vol,  39,  no.  5,  pp. 
1027-1031,  May,  1994. 

[13]  K.  PooUa,  P.  Khargonnekar,  A.  Tikku,  J.  Krause,  and  K.  Nag- 
pal,  “A  time-domain  approach  to  model  validation”,  Proc. 
1992  ACC,  Chicago,  IL,  June  1992,  and  IEEE  Trans.  Aui. 
Contr.,  vol.  39,  no.  5,  pp.  951-959,  May  1994. 

[14]  M.  G.  Safonov  and  T.  C.  Tsew,  “The  unfalsified  control  con¬ 
cept:  a  direct  path  from  experiment  to  controUer”,  presented 
at  Conference  on  Feedback  Control,  Nonlinear  Systems,  and 
Comylexity,  McGill  University,  Montreal, Canada,  May  6-7 
1994. 


10^ 


